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(54) Title: MOLECULAR ANTIGEN ARRAY 

(57) Abstract: The invention provides compositions and processes for the production of ordered and repetitive antigen or antigenic 
determinant arrays. The compositions of the invention are useful for the production of vaccines for the prevention of infectious 
diseases, the treatment of allergies and the treatment of cancers. Various embodiments of the invention provide for a core particle 
that is coated with any desired antigen in a highly ordered and repetitive fashion as the result of specific interactions. 
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MOLECULAR ANTIGEN ARRAY 

BACKGROUND OF THE INVENTION 

Field of the Invention 

[0001] The present invention is related to the fields of molecular biology, 

virology, immunology and medicine. The invention provides a composition 
comprising an ordered and repetitive antigen or antigenic determinant array. The 
invention also provides a process for producing an antigen or antigenic 
determinant in an ordered and repetitive array. The ordered and repetitive antigen 
or antigenic determinant is useful in the production of vaccines for the treatment 
of infectious diseases, the treatment of allergies and as a pharmaccine to prevent 
or cure cancer and to generate defined self-specific antibodies and specific immune 
responses of the Th2 type. 

Background Art 

[0002] Vaccine development for the prevention of infectious disease has had the 

greatest impact on human health of any medical invention. It is estimated that 
three million deaths per year are prevented worldwide by vaccination (Hillemann, 
Nature Medicine 4:507 (1998)). The most common vaccination strategy, the use 
of attenuated (i. e. , less virulent) pathogens or closely related organisms, was first 
demonstrated by Edward Jenner in 1796, who vaccinated against smallpox by the 
administration of a less dangerous cowpox virus. Although a number of live 
attenuated viruses (e.g., measles, mumps, rubella, varicella, adenovirus, polio, 
influenza) and bacteria (e.g., bacille Calmette-Guerin (BCG) against tuberculosis) 
are successfully administered for vaccination, there is a risk for the development 
of serious complications related to a reversion to virulence and infection by the 
'vaccine' organism, in particular in immunocompromised individuals. 

[0003] The specific design of attenuated viruses is now enabled by recombinant 

DNA technology (i.e. , genetic engineering) through the generation of deletion or 
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mutation variants. For example, the administration of an engineered Simian 
Immunodeficiency Virus (SIV) with a deletion within the nef gene was shown to 
protect macaques from subsequent infection with a pathogenic SIV strain (Daniel 
et al, Science 255:1938-1941 (1992)). However, the progression of acquired 
immunodeficiency syndrome (AIDS)-like symptoms in animals administered 
attenuated SIV raises safety concerns (Baba et a!., Science 267:1820-1825 
(1995)). 

[0004] As an alternative approach, attenuated viruses or bacteria may be used as 

carriers for the antigen-encoding genes of a pathogen that is considered too unsafe 
to be administered in an attenuated form (e.g., Human Immunodeficiency Virus 
(HIV)). Upon delivery of the antigen-encoding gene to the host, the antigen is 
synthesized in situ. Vaccinia and related avipox viruses have been used as such 
carriers for various genes in preclinical and clinical studies for a variety of diseases 
(e.g., Shen etal, Science 252:440 (1991)). One disadvantage of this vaccination 
strategy is that it does not mimic the virion surface, because the recombinant 
protein is expressed on the surface of the host cell. Additionally, complications 
may develop in immunocompromised individuals, as evidenced by life-threatening 
disseminated vaccinia infections (Redfield, N. Eng. J. Med. 316:673 (1998)). 

[0005] A fourth vaccination approach involves the use of isolated components of 

a pathogen, either purified from the pathogen grown in vitro (e.g., influenza 
hemagglutinin or neuraminidase) or after heterologous expression of a single viral 
protein (e.g., Hepatitis B surface antigen). For example, recombinant, mutated 
toxins (detoxified) are used for vaccination against diphtheria, tetanus, cholera 
and pertussis toxins (Levine et al, New generation vaccines, 2nd edn., Marcel 
Dekker, Inc., New York 1 997), and recombinant proteins of HIV (gp 1 20 and full- 
length gp 1 60) were evaluated as a means to induce neutralizing antibodies against 
HIV with disappointing results (Connor et a!. 9 J. Virol 72:1552 (1998)). 
Recently, promising results were obtained with soluble oligomeric gp 1 60, that can 
induce CTL response and elicit antibodies with neutralizing activity against HIV- 1 
isolates (Van Cortte* at, J. Virol 71:4319(1997)). In addition, peptide vaccines 
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may be used in which known B~ or T-cell epitopes of an antigen are coupled to 
a carrier molecule designed to increase the immunogenicity of the epitope by 
stimulating T-cell help. However, one significant problem with this approach is 
that it provides a limited immune response to the protein as a whole. Moreover, 
vaccines have to be individually designed for different MHC haplotypes. The 
most serious concern for this type of vaccine is that protective antiviral antibodies 
recognize complex, three-dimensional structures that cannot be mimicked by 
peptides. 

[0006] A more novel vaccination strategy is the use of DNA vaccines (Donnelly 

et aL, Ann. Rev. Immunol. 75:617 (1997)), which may generate MHC Class I- 
restricted CTL responses (without the use of a live vector). This may provide 
broader protection against different strains of a virus by targeting epitopes from 
conserved internal proteins pertinent to many strains of the same virus. Since the 
antigen is produced with mammalian post-translational modification, conformation 
and oligomerization, it is more likely to be similar or identical to the wild-type 
protein produced by viral infection than recombinant or chemically modified 
proteins. However, this distinction may turn out to be a disadvantage for the 
application of bacterial antigens, since non-native post-translational modification 
may result in reduced immunogenicity. In addition, viral surface proteins are not 
highly organized in the absence of matrix proteins. 

[0007] In addition to applications for the prevention of infectious disease, vaccine 

technology is now being utilized to address immune problems associated with 
allergies. In allergic individuals, antibodies of the IgE isotype are produced in an 
inappropriate humoral immune response towards particular antigens (allergens). 
The treatment of allergies by allergy immunotherapy requires weekly 
administration of successively increasing doses of the particular allergen over a 
period of up to 3-5 years. Presumably, 'blocking 3 IgG antibodies are generated 
that intercept allergens in nasal or respiratory secretions or in membranes before 
they react with IgE antibodies on mast cells. However, no constant relationship 
exists between IgG titers and symptom relief. Presently, this is an extremely time- 
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and cost-consuming process, to be considered only for patients with severe 
symptoms over an extended period each year. 

[0008] It is well established that the administration of purified proteins alone is 

usually not sufficient to elicit a strong immune response; isolated antigen generally 
must be given together with helper substances called adjuvants. Within these 
adjuvants, the administered antigen is protected against rapid degradation, and the 
adjuvant provides an extended release of a low level of antigen. 

[0009] Unlike isolated proteins, viruses induce prompt and efficient immune 

responses in the absence of any adjuvants both with and without T-cell help 
(Bachmann & Zinkernagel, Ann. Rev. Immunol 15:235-270 (1997)). Although 
viruses often consist of few proteins, they are able to trigger much stronger 
immune responses than their isolated components. For B cell responses, it is 
known that one crucial factor for the immunogenicity of viruses is the 
repetitiveness and order of surface epitopes. Many viruses exhibit a quasi- 
crystalline surface that displays a regular array of epitopes which efficiently 
crosslinks epitope-specific immunoglobulins onB cells (Bachmann & Zinkernagel, 
Immunol Today 77:553-558 (1996)). This crosslinking of surface 
immunoglobulins on B cells is a strong activation signal that directly induces cell- 
cycle progression and the production of IgM antibodies. Further, such triggered 
B cells are able to activate T helper cells, which in turn induce a switch from IgM 
to IgG antibody production in B cells and the generation of long-lived B cell 
memory - the goal of any vaccination (Bachmann & Zinkernagel, Ann. Rev. 
Immunol 75:235-270 (1997)). Viral structure is even linked to the generation of 
anti-antibodies in autoimmune disease and as a part of the natural response to 
pathogens {see Fehr, T., et al, J. Exp. Med 785:1785-1792 (1997)). Thus, 
antigens on viral particles that are organized in an ordered and repetitive array are 
highly immunogenic since they can directly activate B cells. 

[0010] In addition to strong B cell responses, viral particles are also able to induce 

the generation of a cytotoxic T cell response, another crucial arm of the immune 
system. These cytotoxic T cells are particularly important for the elimination of 
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non-cytopathic viruses such as HIV or Hepatitis B virus and for the eradication 
of tumors. Cytotoxic T cells do not recognize native antigens but rather recognize 
their degradation products in association with MHC class I molecules (Townsend 
& Bodmer, Ann. Rev. Immunol 7:601-624 (1989)). Macrophages and dendritic 
cells are able to take up and process exogenous viral particles (but not their 
soluble, isolated components) and present the generated degradation product to 
cytotoxic T cells, leading to their activation and proliferation (Kovacsovics- 
Bankowski etaL, Proc. Natl. Acad. Sci. USA 90:4942-4946 (1993); Bachmann 
etal, Eur. J. Immunol 26:2595-2600 (1996)). 
[0011] Viral particles as antigens exhibit two advantages over their isolated 

components: (1) Due to their highly repetitive surface structure, they are able to 
directly activate B cells, leading to high antibody titers and long-lasting B cell 
memory; and (2) Viral particles but not soluble proteins are able to induce a 
cytotoxic T cell response, even if the viruses are non-infectious and adjuvants are 
absent. 

[0012] Several new vaccine strategies exploit the inherent immunogenicity of 

viruses. Some of these approaches focus on the particulate nature of the virus 
particle; for example see Harding, C. V. and Song, R., (J. Immunology 153:4925 
(1994)), which discloses a vaccine consisting of latex beads and antigen; 
Kovacsovics-Bankowski, M., et al (Proc. Natl Acad. Sci. USA 90:4942-4946 
(1 993)), which discloses a vaccine consisting of iron oxide beads and antigen; U. S . 
Patent No 5,334,394 to Kossovsky, N., et al, which discloses core particles 
coated with antigen; U.S. Patent No. 5,871,747, which discloses synthetic 
polymer particles carrying on the surface one or more proteins covalently bonded 
thereto; and a core particle with a non-covalently bound coating, which at least 
partially covers the surface of said core particle, and at least one biologically 
active agent in contact with said coated core particle {see, e.g., WO 94/15585). 

[0013] However, a disadvantage of these viral mimicry systems is that they are 

not able to recreate the ordered presentation of antigen found on the viral surface. 
Antigens coupled to a surface in a random orientation are found to induce CTL 
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response and no or only weak B-cell response. For an efficient vaccine, both arms 
of the immune system have to be strongly activated, as described above and in 
Bachmann & ZinkernageL, Ann. Rev. Immunol J 5:235 (1997). 

[0014] In another example, recombinant viruses are being utilized for antigen 

delivery. Filamentous phage virus containing an antigen fused to a capsid protein 
has been found to be highly immunogenic (see Perham R.N., et al, FEMS 
Microbiol Rev. 77:25-31 (1995); Willis et al, Gene 725:85-88 (1993); 
Minenkova et al, Gene 725:85-88 (1993)). However, this system is limited to 
very small peptides (5 or 6 amino acid residues) when the fusion protein is 
expressed at a high level (Iannolo et al, J. Mol Biol 245:835-844 (1995)) or 
limited to the low level expression of larger proteins (de la Cruz et al, J. Biol 
Chem. 255:4318-4322(1988)). For small peptides, so far only the CTL response 
is observed and no or only weak B-cell response. , 

[0015] In yet another system, recombinant alphaviruses are proposed as a means 

of antigen delivery (see U.S. Patent Nos. 5,766,602; 5,792,462; 5,739,026; 
5;789,245 and 5,814,482). Problems with the recombinant virus systems 
described so far include a low density expression of the heterologous protein on 
the viral surface and/or the difficulty of successfully and repeatedly creating a new 
and different recombinant viruses for different applications. 

[0016] In a further development, virus-like particles (VLPs) are being exploited 

in the area of vaccine production because of both their structural properties and 
their non-infectious nature. VLPs are supermolecular structures built in a 
symmetric manner from many protein molecules of one or more types. They lack 
the viral genome and, therefore, are noninfectious. VLPs can often be produced 
in large quantities by heterologous expression and can be easily be purified. 

[0017] Examples of VLPs include the capsid proteins of Hepatitis B virus (Ulrich, 

* et al, Virus Res. 50:141-182 (1998)), measles virus (Warnes, et al, Gene 
750:173-178 (1995)), Sindbis virus, rotavirus (U.S. Patent Nos. 5,071,651 and 
5,374,426), foot-and-mouth-disease virus (Twomey, et al, Vaccine 
73:1603-1610, (1995)), Norwalk virus (Jiang, X., etaL, Science 250:1580-1583 
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(1990); Matsui, S.M., etal, J. Clin. Invest 57:1456-1461 (1991)), the retroviral 
GAG protein (PCT Patent Appl. No. WO 96/30523), the retrotransposon Ty 
protein pi, the surface protein of Hepatitis B virus (WO 92/11291) and human 
papilloma virus (WO 98/15631). In some instances, recombinant DNA 
technology may be utilized to fuse a heterologous protein to a VLP protein 
(Kratz, P.A., etal., Proc. Natl Acad. Set USA 96: 19151920 (1999)). 
[0018] Thus, there is a need in the art for the development of new and improved 

vaccines that promote a strong CTL and B-cell immune response as efficiently as 
natural pathogens. 

BRIEF SUMMARY OF THE INVENTION 

[0019] The invention provides a versatile new technology that allows production 

of particles or pili coated with any desired antigen. The technology allows the 
creation of highly efficient vaccines against infectious diseases and for the creation 
of vaccines for the treatment of allergies and cancers. The invention also provides 
compositions suited for the induction of Th type 2 T-helper cells (Th2 cells). 
Thus, efficient vaccines for the treatment of chronic diseases induced or 
accelerated by a Thl type immune response, such as arthritis, colitis, diabetes and 
multiple sclerosis can be produced with the technology provided by this invention. 

[0020] In a first embodiment, the invention provides a novel composition 

comprising (A) a non-natural molecular scaffold and (B) an antigen or antigenic 
determinant. 

[0021] The non-natural molecular scaffold comprises, or alternatively consists of, 

(i) a core particle selected from the group consisting of (1) a core particle of non- 
natural origin and (2) a core particle of natural origin; and (ii) an organizer 
comprising at least one first attachment site, wherein said organizer is connected 
to said core particle by at least one covalent bond. 

[0022] In certain specific embodiments, the core particle naturally contains an 

organizer. One example of an embodiment of the invention where the organizer 
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is naturally occurring is the bacterial pilus or pilin protein. The antigenic 
determinant may be linked by a cysteine to a naturally occurring lysine residue of 
the bacterial pili or pilin protein. 

[0023] The antigen or antigenic determinant has at least one second attachment 

site which is selected from the group consisting of (i) an attachment site not 
naturally occurring with said antigen or antigenic determinant; and (ii) an 
attachment site naturally occurring with said antigen or antigenic determinant. 

[0024] The invention provides for an ordered and repetitive antigen array through 

an association of the second attachment site to the first attachment site by way of 
at least one non-peptide bond. Thus, the antigen or antigenic determinant and the 
non-natural molecular scaffold are brought together through this association of the 
first and the second attachment site to form an ordered and repetitive antigen 
array. 

[0025] In another embodiment, the core particle of the aforementioned 

composition comprises a virus, a virus-like particle, a bacterial pilus, a structure 
formed from bacterial pilin, a bacteriophage, a viral capsid particle or a 
recombinant form thereof. Alternatively, the core particle may be a synthetic 
polymer or a metal. 

[0026] In yet another embodiment, the core particle comprises, or alternatively 

consists of, one or more different Hepatitis core (capsid) proteins (HBcAgs). In 
a related embodiment, one or more cysteine residues of these HBcAgs are either 
deleted or substituted with another amino acid residue (e.g., a serine residue). In 
a specific embodiment, the cysteine residues of the HBcAg used to prepare 
compositions of the invention which correspond to amino acid residues 48 and 
107 in SEQ ID NO: 134 are either deleted or substituted with another amino acid 
residue (e.g., a serine residue). 

[0027] Further, the HBc Ag variants used to prepare compositions of the invention 

will generally be variants which retain the ability to associate with other HBcAgs 
to form dimeric or multimeric structures that present ordered and repetitive 
antigen or antigenic determinant arrays. 
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[0028] In another embodiment, the non-natural molecular scaffold comprises, or 

alternatively consists of, pili or pilus-like structures that have been either produced 
from pilin proteins or harvested from bacteria. When pili or pilus-like structures 
are used to prepare compositions of the invention, they may be formed from 
products of pilin genes which are naturally resident in the bacterial cells but have 
been modified by genetically engineered (e.g., by homologous recombination) or 
pilin genes which have been introduced into these cells. 

[0029] In a related embodiment, the core particle comprises, or alternatively 

consists of, pili or pilus-like structures that have been either prepared from pilin 
proteins or harvested from bacteria. These core particles may be formed from 
products of pilin genes naturally resident in the bacterial cells. Further, antigens 
or antigenic determinants may be linked to these core particles naturally containing 
an organizer. In such a case, the core particles will generally be linked to a second 
attachment site of the antigen or antigenic determinant. In most embodiments of 
the invention, the pili or pilus-like structures will be able to form an ordered and 
repetitive antigen array with the antigen or antigenic determinant linked to the 
core particle at a specific or preferred location (e.g., a specific amino acid 
residue). 

[0030] In a particular embodiment, the organizer may comprise at least one first 

attachment site. The first and the second attachment sites are particularly 
important elements of compositions of the invention. In various embodiments of 
the invention, the first and/or the second attachment site may be an antigen and an 
antibody or antibody fragment thereto; biotin and avidin; strepavidin and biotin; 
a receptor and its ligand; a ligand-binding protein and its ligand; interacting leucine 
zipper polypeptides; an amino group and a chemical group reactive thereto; a 
carboxyl group and a chemical group reactive thereto; a sulfhydryl group and a 
chemical group reactive thereto; or a combination thereof. 

[0031] In one embodiment, the invention provides the coupling of almost any 

antigen of choice to the surface of a virus, bacterial pilus, structure formed from 
bacterial pilin, bacteriophage, virus-like particle or viral capsid particle. By 
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bringing an antigen into a quasi-crystalline 'virus-like 5 structure, the invention 
exploits the strong antiviral immune reaction of a host for the production of a 
highly efficient immune response, i. e. 9 a vaccination, against the displayed antigen. 
[0032] In another embodiment, the core particle may be selected from the group 

consisting of: recombinant proteins of Rotavirus, recombinant proteins of 
Norwalk virus, recombinant proteins of Alphavirus, recombinant proteins of Foot 
and Mouth Disease virus, recombinant proteins of Retrovirus, recombinant 
proteins of Hepatitis B virus, recombinant proteins of Tobacco mosaic virus, 
recombinant proteins of Flock House Virus, and recombinant proteins of human 
Papilomavirus. 

[0033] In yet another embodiment, the antigen may be selected from the group 

consisting of: (1) a protein suited to induce an immune response against cancer 
cells; (2) a protein suited to induce an immune response against infectious 
diseases; (3) a protein suited to induce an immune response against allergens; and 
(4) a protein suited to induce an immune response in pets or farm animals. 

[0034] In one embodiment, the invention relates to the induction of specific Th 

type 2 T-helper cells (Th2 cells) using antigens attached to Pili. The induction of 
Th2 responses may be beneficial for the treatment of a number of diseases. For 
example, many chronic diseases in humans an animals, such as arthritis, colitis, 
diabetes and multiple sclerosis are dominated by Thl response, where T cells 
secrete IFN y and other pro-inflammatory cytokines precipitating disease. 

[0035] In a particularly embodiment of the invention, the first attachment site 

and/or the second attachment site comprise an interacting leucine zipper 
polypeptide. In a related embodiment, the first attachment site and/or the second 
attachment site are selected from the group comprising: (1) the JUN leucine 
zipper protein domain; and (2) the FOS leucine zipper protein domain. 

[0036] In another embodiment, the first attachment site and/or the second 

attachment site are selected from the group comprising: (1) a genetically 
engineered lysine residue and (2) a genetically engineered cysteine residue, two 
residues that may be chemically linked together. 
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[0037] The invention also includes embodiments where the organizer particle has 

only a single first attachment site and the antigen or antigenic determinant has only 
a single second attachment site. Thus, when an ordered and repetitive antigen 
array is prepared using such embodiments, each organizer will be bound to a 
single antigen or antigenic determinant. 

[0038] In one aspect, the invention provides compositions comprising, or 

alternatively consisting of, (a) a non-natural molecular scaffold comprising (i) a 
core particle selected from the group consisting of a core particle of non-natural 
origin and a core particle of natural origin, and (ii) an organizer comprising at least 
one first attachment site, wherein the core particle comprises, or alternatively 
consists of, a bacterial pilus, a pilus-like structure, or a modified HBcAg, or 
fragment thereof, and wherein the organizer is connected to the core particle by 
at least one covalent bond, and (b) an antigen or antigenic determinant with at 
least one second attachment site, the second attachment site being selected from 
the group consisting of (i) an attachment site not naturally occurring with the 
antigen or antigenic determinant and (ii) an attachment site naturally occurring 
with the antigen or antigenic determinant, wherein the second attachment site is 
capable of association through at least one non-peptide bond to the first 
attachment site, and wherein the antigen or antigenic determinant and the scaffold 
interact through the association to form an ordered and repetitive antigen array. 

[0039] Other embodiments of the invention include processes for the production 

of compositions of the invention and a methods of medical treatment using vaccine 
compositions described herein. 

[0040] It is to be understood that both the foregoing general description and the 

following detailed description are exemplary and explanatory only and are 
intended to provide further explanation of the invention as claimed. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0041] Figure 1 shows a Western blot demonstrating the production of viral 

particles containing the E2-JUN fusion protein using the pCYTts::EZ/£/7V 
expression vector. 

[0042] Figure 2 shows a Western blot demonstrating the production of viral 

particles containing the E2-JUN fusion protein expressed from pTE5'2J::E2/£/iV 
expression vector. 

[0043] Figure 3 shows a Western dot blot demonstrating bacterial and eukaryotic 

expression of the FOS-hgh antigen. 
[0044] Figure 4 shows the expression of HBcAg-JUN in E. coli cells. 

[0045] Figure 5 shows a Western blot demonstrating that HBcAg-JUN is soluble 

inE. coli ly sates. 

[0046] Figure 6 shows an SDS-PAGE analysis of enrichment of HBcAg-JUN 

capsid particles on a sucrose density gradient. 
[0047] Figure 7 shows a non-reducing SDS-PAGE analysis of the coupling of 

hGH-FOS and HBcAg-JUN particles. 
[0048] Figure 8 depicts an analysis by SDS-PAGE of the coupling reaction of the 

FLAG peptide to HBcAG-Lys treated with iodacetamide and activated with 

Sulfo-MB S . The excess of cross-linker and of peptide over HBcAg-Lys monomer 

is indicated below the figure. 
[0049] Figure 9 depicts an analysis of coupling of the FLAG peptide to type-1 

bacterial pili by SDS-PAGE. Lane 1 shows the unreacted pili subunit FimA. Lane 

3 shows the purified reaction mixture of the pili with the FLAG peptide. The 

upper band corresponds to the coupled product, while the lower band corresponds 

to the unreached subunit. 
[0050] Figure 10 depicts an analysis by SDS-PAGE of the derivatization of 

HBcAg-Lys with SPDP. 
[0051] Figure 11 depicts an analysis by SDS-PAGE of the derivatization of 

HBcAg-Lys with Sulfo-MBS. 
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[0052] Figure 12 depicts an analysis by SDS-PAGE of the coupling of HBcAg- 

Lys-2cyc-Mut to the FLAG peptide. The arrow shows the bands corresponding 
to the coupling of one and two FLAG peptides, respectively, to one subunit of 
HBcAgLys-2cyc-Mut. Lane M corresponds to the marker, lane 1 to the 
unreached HBcAg~Lys-2cyc-Mut, lane 2 to HBcAg-Lys-2cyc-Mut activated with 
Sulfo-MBS, and lane 3 activated HBcAg-Lys-2cyc-Mut after reaction with the 
FLAG peptide containing an TM-terminal cysteine. 

[0053] Figure 13 depicts an analysis by SDS-PAGE of the coupling of pili to the 

p33 peptide. 

[0054] Figure 14A shows an analysis of coupling of DPI 78c peptide by SDS- 

PAGE analysis and Coomassie staining. Lane 1 corresponds to the supernatant of 
the coupling reaction after centrifiigation, while lane 2 corresponds to the pellet. 
Figure 14B show an ELISA data and subtype analysis of mice, sera immunized 
with Pili-DP 1 78c. The OD (450 nm) of the ELISA signal obtained at a fifty-fold 
dilution of the sera is shown in the diagram. For each subtype determination, mice 
sera were titrated from a fifty-fold dilution in two-fold dilution steps. The ELISA 
titer of the IgGl subtype (OD50 dilution) was 1:400, while the titer of the IgG2b 
subtype was 1 : 100. The other subtypes all had titers inferior to 1 :50. The IgG 
isotype pattern is characteristic of a Th2 response, with a high IgGl titer and a 
low IgG2a titer. 

[0055] Figure 1 5 A shows an analysis of Coupling of GRA2 to Pili by SDS-PAGE 

analysis and Coomassie staining. Figure 1 5B relates to immunization of mice with 
Pili-GRA2 and IgG subtype determination. Depicted is an analysis of total IgG 
titer and IgG subtype titers by ELISA. The ELISA titer is given by the dilution 
of sera at which OD50 is obtained. The result of the immunization of two 
individual mice is shown. A high IgGl titer and a low IgG2a titer is characteristic 
of a Th2 response. 

[0056] Figure 16 A shows an analysis of coupling of B2 and D2 peptides to Pili 

by SDS-PAGE analysis and Coomassie staining. Figure 16B relates to 
immunization of mice with Pili-B2 and IgG subtype determination. The OD (450 
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nm) of the ELIS A signal obtained at a fifty-fold dilution of the sera is shown in the 
diagram. For each subtype determination, mice sera were titrated from a fifty-fold 
dilution in two-fold dilution steps. The titer of the IgGl subtype (dilution at 
which the signal corresponds to OD 50) wasl : 250, while the other subtypes all 
had titers inferior to 1 :50. The titer of the IgGl subtype is much higher than the 
titer of the IgG2a subtype, a pattern typical for a Th2 response. 

[0057] Figure 17 relates to the measurement of antibodies specific for TNFa 

protein in the serum of mice immunized with the muTNFa peptide coupled to 
type-1 Pili. As a control, preimmune sera of two mice were assayed for binding 
to TNFa protein. Sera were added at three different dilutions (1:50, 1:100 and 
1 :200), and bound IgG was detected with a horseradish peroxidase-conjugated 
anti-murine IgG antibody. Results from four individual mice are shown on day 2 1 
and day 43. OD (450 nm): optical density at 450 nm. 

[0058] Figure 18 A shows an analysis of coupling of S'-TNF II and 3'-TNF II by 

SDS-PAGE and Coomassie staining. Lane M is the marker lane. Untreated Pili 
were loaded on lane 1, Pili-5'-TNF II before dialysis on lane 2, Pili-3'-TNF II 
before dialysis on lane 3, Pili-5'-TNF II after dialysis on lane 4, pili-3'-TNF II after 
dialysis on lane 5. The arrow indicates the size at which the coupled product 
migrates. 

[0059] Figure 1 SB shows an ELIS A analysis of sera of mice immunized with Pili- 

5'-TNF II and Pili-3'-TNF II: Anti-TNFa ELISA IgG antibodies specific for 
native TNFa protein were measured in a specific ELISA. 2 //g/ml native TNFa 
protein was coated on ELISA plates. Sera were added at different dilutions and 
bound IgG was detected with a horseradish peroxidase-conjugated anti-murine 
IgG antibody. Results from four individual mice are shown on day 21 and day 43 
OD (450 nm): optical density at 450 nm. The data show that mice immunized 
with the TNF peptides coupled to pili mount an antibody response against native 
TNFa protein, thus breaking self-tolerance. 

[0060] Figure 1 8C shows an ELISA analysis of sera of mice immunized with Pili- 

5'-TNF II and Pili-3'-TNF II: Anti-TNFa peptide ELISA. IgG antibodies specific 
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for the 5TNF II and 3'TNF II peptides were measured in a specific ELISA: 10 
jUg/ml Ribonuclease A coupled to 5'TNF II or 3TNF II peptide was coated on 
ELISA plates. Sera were added at different dilutions and bound IgG was detected 
with a horseradish peroxidaseconjugated anti-murine IgG antibody. Results from 
four individual mice are shown on day 21 . 

[0061] Figure 18D shows that IgG subtype analysis of anti-TNF peptide 

antibodies in mice vaccinated with the corresponding TNF-peptides coupled to 
Pili. Results from four individual mice (no. 1-4) are shown on day 50. ELISA 
titer; dilution step at which half-maximal optical density was reached (-log 2 of 
40-fold prediluted sera). The high IgGl titer obtained as compared to the very 
low IgG2a titer is typical of a Th2 response. 

[0062] Figure 19A shows an analysis of coupling of M2 peptide to Pili by SDS- 

PAGE analysis and Coomassie staining. The bands corresponding to non-coupled 
Pili and to the coupling product, Pili-M2, are indicated by arrows. Figure 19B 
shows an ELISA analysis and IgG subtype determination of mice vaccinated with 
Pili-M2. Sera were diluted eighty-fold, and titrated down in two-fold dilution 
steps. For the IgGl subtype, a titer of 1 :2560 was obtained, while for the IgG2a 
and IgG2b subtypes, titers below 1:100 were obtained. The titer for the IgG3 
subtype was below 1 :80. Titers were calculated as the serum dilution resulting in 
half-maximal optical density (OD 50 ). A strong IgGl titer in conjunction with a low 
IgG2a titer is characteristic for a Th2 type response. Average results from two 
mice are shown as optical densities obtained with a 1:80 dilution of the serum. 

[0063] Figure 20 shows an ELISA analysis and IgG subtype determination of sera 

from mice immunized with HBcAg-Lys-2cys-Mut coupled to the Flag peptide. 
Ribonuclease A coupled to Flag peptide was coated at 10 //g/ml, and serum was 
added at a 1 :40 dilution. In contrast to experiments where mice were immunized 
with antigens coupled to Pili, there is no predominance of the IgGl subtype over 
the other IgG subtypes. 
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DET AILED DESCRIPTION OF THE INVENTION 
1 . Definitions 

[0064] The following definitions are provided to clarify the subject matter which 

the inventors consider to be the present invention. 

[0065] Alphavirus: As used herein, the term "alphavirus" refers to any of the RNA 

viruses included within the genus Alphavirus. Descriptions of the members of this 
genus are contained in Strauss and Strauss, Microbiol Rev. s 55:491-562 (1994). 
Examples of alphaviruses include Aura virus, Bebaru virus, Cabassou virus, 
Chikungunya virus, Easter equine encephalomyelitis virus, Fort morgan virus, 
Getah virus, Kyzylagach virus, Mayoaro virus, Middleburg virus, Mucambo virus, 
Ndumu virus, Pixuna virus, Tonate virus, Triniti virus, Una virus, Western equine 
encephalomyelitis virus, Whataroa virus, Sindbis virus (SIN), Semliki forest virus 
(SFV), Venezuelan equine encephalomyelitis virus (VEE), and Ross River virus. 

[0066] Antigen: As used herein, the term "antigen" is a molecule capable of being 

bound by an antibody. An antigen is additionally capable of inducing a humoral 
immune response and/or cellular immune response leading to the production of B- 
and/or t-lymphocytes. An antigen may have one or more epitopes (B- and T- 
epitopes). The specific reaction referred to above is meant to indicate that the 
antigen will react, in a highly selective manner, with its corresponding antibody 
and not with the multitude of other antibodies which may be evoked by other 
antigens. 

[0067] Antigenic determinant: As used herein, the term" antigenic determinant" 

is meant to refer to that portion of an antigen that is specifically recognized by 
either B- or T-lymphocytes. B-lymphocytes respond to foreign antigenic 
determinants via antibody production, whereas T-lymphocytes are the mediator 
of cellular immunity. Thus, antigenic determinants or epitopes are those parts of 
an antigen that are recognized by antibodies, or in the context of an MHC, by T- 
cell receptors. 
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[0068] Association: As used herein, the term "association 1 ' as it applies to the first 

and second attachment sites, is used to refer to at least one non-peptide bond. 
The nature of the association may be covalent, ionic, hydrophobic, polar or any 
combination thereof. 

[0069] Attachment Site, First: As used herein, the phrase "first attachment site" 

refers to an element of the "organizer", itself bound to the core particle in a non- 
random fashion, to which the second attachment site located on the antigen or 
antigenic determinant may associate. The first attachment site may be a protein, 
a polypeptide, an amino acid, a peptide, a sugar, a polynucleotide, a natural or 
synthetic polymer, a secondary metabolite or compound (biotin, fluorescein, 
retinol, digoxigenin, metal ions, phenylmethylsulfonylfluoride), or a combination 
thereof, or a chemically reactive group thereof. Multiple first attachment sites are 
present on the surface of the non-natural molecular scaffold in a repetitive 
configuration. 

[0070] Attachment Site, Second: As used herein, the phrase "second attachment 

site" refers to an element associated with the antigen or antigenic determinant to 
which the first attachment site of the "organizer" located on the surface of the 
non-natural molecular scaffold may associate. The second attachment site of the 
antigen or antigenic determinant may be a protein, a polypeptide, a peptide, a 
sugar, a polynucleotide, a natural or synthetic polymer, a secondary metabolite or 
compound (biotin, fluorescein, retinol, digoxigenin, metal ions, 
phenylmethylsulfonylfluoride), or a combination thereof, or a chemically reactive 
group thereof. At least one second attachment site is present on the antigen or 
antigenic determinant. 

[0071] Core particle: As used herein, the term "core particle" refers to a rigid 

structure with an inherent repetitive organization that provides a foundation for 
attachment of an "organizer". A core particle as used herein may be the product 
of a synthetic process or the product of a biological process. 

[0072] In certain embodiments of the invention, the antigens or antigenic 

determinants are directly linked to the core particle. 
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[0073] Cis-acting: As used herein, the phrase "czs-acting" sequence refers to 

nucleic acid sequences to which a replicase binds to catalyze the RNA-dependent 
replication of RNA molecules. These replication events result in the replication 
of the full-length and partial RNA molecules and, thus, the alpahvirus subgenomic 
promoter is also a "czs-acting" sequence. C/s-acting sequences may be located at 
or near the 5' end, 3 f end, or both ends of a nucleic acid molecule, as well as 
internally. 

[0074] Fusion: As used herein, the term "fusion" refers to the combination of 

amino acid sequences of different origin in one polypeptide chain by in-frame 
combination of their coding nucleotide sequences. The term "fusion" explicitly 
encompasses internal fusions, i.e., insertion of sequences of different origin within 
a polypeptide chain, in addition to fusion to one of its termini. 

[0075] Heterologous sequence: As used herein, the term "heterologous sequence" 

refers to a second nucleotide sequence present in a vector of the invention. The 
term "heterologous sequence" also refers to any amino acid or RNA sequence 
encoded by a heterologous DNA sequence contained in a vector of the invention. 
Heterologous nucleotide sequences can encode proteins or RNA molecules 
normally expressed in the cell type in which they are present or molecules not 
normally expressed therein (e.g., Sindbis structural proteins). 

[0076] Isolated: As used herein, when the term "isolated" is used in reference to 

a molecule, the term means that the molecule has been removed from its native 
environment. For example, a polynucleotide or a polypeptide naturally present in 
a living animal is not "isolated," but the same polynucleotide or polypeptide 
separated from the coexisting materials of its natural state is "isolated." Further, 
recombinant DNA molecules contained in a vector are considered isolated for the 
purposes of the present invention. Isolated RNA molecules include in vivo or in 
vitro RNA replication products of DNA and RNA molecules. Isolated nucleic 
acid molecules further include synthetically produced molecules. Additionally, 
vector molecules contained in recombinant host cells are also isolated. Thus, not 
all "isolated" molecules need be "purified." 
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[0077] Immunotherapeutic: As used herein, the term "immunotherapeutic" is a 

composition for the treatment of diseases or disorders. More specifically, the term 
is used to refer to a method of treatment for allergies or a method of treatment for 
cancer. 

[0078] Individual: As used herein, the term "individual" refers to multicellular 

organisms and includes both plants and animals. Preferred multicellular organisms 
are animals, more preferred are vertebrates, even more preferred are mammals, 
and most preferred are humans. 

[0079] Low or undetectable: As used herein, the phrase "low or undetectable," 

when used in reference to gene expression level, refers to a level of expression 
which is either significantly lower than that seen when the gene is maximally 
induced {e.g. , at least five fold lower) or is not readily detectable by the methods 
used in the following examples section. 

[0080] Lectin: As used herein, proteins obtained particularly from the seeds of 

leguminous plants, but also from many other plant and animal sources, that have 
binding sites for specific mono- or oligosaccharides. Examples include 
concanavalin A and wheat-germ agglutinin, which are widely used as analytical 
and preparative agents in the study of glycoprotein. 

[0081] Natural origin: As used herein, the term "natural origin" means that the 

whole or parts thereof are not synthetic and exist or are produced in nature. 

[0082] Non-natural: As used herein, the term generally means not from nature, 

more specifically, the term means from the hand of man. 

[0083] Non-natural origin: As used herein, the term "non-natural origin" generally 

means synthetic or not from nature; more specifically, the term means from the 
hand of man. 

[0084] Non-natural molecular scaffold: As used herein, the phrase "non-natural 

molecular scaffold" refers to any product made by the hand of man that may serve 
to provide a rigid and repetitive array of first attachment sites. Ideally but not 
necessarily, these first attachment sites are in a geometric order. The non-natural 
molecular scaffold may be organic or non-organic and may be synthesized 
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chemically or through a biological process, in part or in whole. The non-natural 
molecular scaffold is comprised of: (a) a core particle, either of natural or non- 
natural origin; and (b) an organizer, which itself comprises at least one first 
attachment site and is connected to a core particle by at least one covalent bond. 
In a particular embodiment, the non-natural molecular scaffold may be a virus, 
virus-like particle, a bacterial pilus, a virus capsid particle, a phage, a recombinant 
form thereof, or synthetic particle. 

[0085] Ordered and repetitive antigen or antigenic determinant array: As used 

herein, the term "ordered and repetitive antigen or antigenic determinant array" 
generally refers to a repeating pattern of antigen or antigenic determinant, 
characterized by a uniform spacial arrangement of the antigens or antigenic 
determinants with respect to the non-natural molecular scaffold. In one 
embodiment of the invention, the repeating pattern may be a geometric pattern. 
Examples of suitable ordered and repetitive antigen or antigenic determinant 
arrays are those which possess strictly repetitive paracrystalline orders of antigens 
or antigenic determinants with spacings of 5 to 15 nanometers. 

[0086] Organizer: As used herein, the term "organizer" is used to refer to an 

element bound to a core particle in a non-random fashion that provides a 
nucleation site for creating an ordered and repetitive antigen array. An organizer 
is any element comprising at least one first attachment site that is bound to a core 
particle by at least one covalent bond. An organizer may be a protein, a 
polypeptide, a peptide, an amino acid {i.e., a residue of a protein, a polypeptide 
or peptide), a sugar, a polynucleotide, a natural or synthetic polymer, a secondary 
metabolite or compound (biotin, fluorescein, retinol, digoxigenin, metal ions, 
phenylmethylsulfonylfluoride), or a combination thereof, or a chemically reactive 
group thereof. 

[0087] Permissive temperature: As used herein, the phrase "permissive 

temperature" refers to temperatures at which an enzyme has relatively high levels 
of catalytic activity. 
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[0088] Pili: As used herein, the term "pili" (singular being "pilus") refers to 

extracellular structures of bacterial cells composed of protein monomers (e.g., 
pilin monomers) which are organized into ordered and repetitive patterns. 
Further, pili are structures which are involved in processes such as the attachment 
of bacterial cells to host cell surface receptors, inter-cellular genetic exchanges, 
and cell-cell recognition. Examples of pili include Type-1 pili, P-pili 5 F1C pili, 
S-pili, and 98 7P -pili. Additional examples of pili are set out below. 

[0089] Pilus-like structure: As used herein, the phrase "pilus-like structure" refers 

to structures having characteristics similar to that of pili and composed of protein 
monomers. One example of a "pilus-like structure" is a structure formed by a 
bacterial cell which expresses modified pilin proteins that do not form ordered and 
repetitive arrays that are essentially identical to those of natural piliA 

[0090] Purified: As used herein, when the term "purified" is used in reference to 

a molecule, it means that the concentration of the molecule being purified has been 
increased relative to molecules associated with it in its natural environment. 
Naturally associated molecules include proteins, nucleic acids, lipids and sugars 
but generally do not include water, buffers, and reagents added to maintain the 
integrity or facilitate the purification of the molecule being purified. For example, 
even if mRNA is diluted with an aqueous solvent during oligo dT column 
chromatography, mRNA molecules are purified by this chromatography if 
naturally associated nucleic acids and other biological molecules do not bind to the 
column and are separated from the subject mRNA molecules. 

[0091] Receptor: As used herein, the term "receptor" refers to proteins or 

glycoproteins or fragments thereof capable of interacting with another molecule, 
called the ligand. The ligand may belong to any class of biochemical or chemical 
compounds. The receptor need not necessarily be a membrane-bound protein. 
Soluble protein, like e.g., maltose binding protein or retinol binding protein are 
receptors as well. 

[0092] Residue: As used herein, the term "residue" is meant to mean a specific 

amino acid in a polypeptide backbone or side chain. 



WO 01/85208 



PCT/IB01/00741 



-22- 



[0093] Temperature-sensitive: As used herein, the phrase "temperature-sensitive" 

refers to an enzyme which readily catalyzes a reaction at one temperature but 
catalyzes the same reaction slowly or not at all at another temperature. An 
example of a temperature-sensitive enzyme is the replicase protein encoded by the 
pCYTts vector, which has readily detectable replicase activity at temperatures 
below 34 °C and has low or undetectable activity at 37 °C. 

[0094] Transcription: As used herein, the term "transcription" refers to the 

production of KNA molecules from DNA templates catalyzed by KNA 
polymerase. 

[0095] Recombinant host cell: As used herein, the term "recombinant host cell" 

refers to a host cell into which one ore more nucleic acid molecules of the 
invention have been introduced. 

[0096] Recombinant virus: As used herein, the phrase "recombinant virus" refers 

to a virus that is genetically modified by the hand of man. The phrase covers any 
virus known in the art. More specifically, the phrase refers to a an alphavirus 
genetically modified by the hand of man, and most specifically, the phrase refers 
to a Sinbis virus genetically modified by the hand of man. 

[0097] Restrictive temperature: As used herein, the phrase "restrictive 

temperature" refers to temperatures at which an enzyme has low or undetectable 
levels of catalytic activity. Both "hot" and "cold" sensitive mutants are known 
and, thus, a restrictive temperature may be higher or lower than a permissive 
temperature. 

[0098] RNA-dependent RNA replication event: As used herein, the phrase 

"RNA-dependent RNA replication event" refers to processes which result in the 
formation of an RNA molecule using an RNA molecule as a templateA 

[0099] RNA-Dependent RNA polymerase: As used herein, the phrase "RNA- 

Dependent RNA polymerase" refers to a polymerase which catalyzes the 
production of an RNA molecule from another RNA molecule. This term is used 
herein synonymously with the term "replicase." 
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[0100] Untranslated RNA: As used herein, the phrase ''untranslated RNA" refers 

to an RNA sequence or molecule which does not encode an open reading frame 
or encodes an open reading frame, or portion thereof, but in a format in which an 
amino acid sequence will not be produced (e.g., no initiation codon is present). 
Examples of such molecules are tRNA molecules, rRNA molecules, and 
ribozymes. 

[0101] Vector: As used herein, the term "vector" refers to an agent (e.g., a 

plasmid or virus) used to transmit genetic material to a host cell. A vector may 
be composed of either DNA or RNA. 

[0102] one, a, or an: When the terms "one," "a," or "an" are used in this 

disclosure, they mean "at least one" or "one or more," unless otherwise indicated. 

2. Compositions of Ordered and Repetitive Antigen or Antigenic 
Determinant Arrays and Methods to Make the Same 

[0103] The disclosed invention provides compositions comprising an ordered and 

repetitive antigen or antigenic determinant. Furthermore, the invention 
conveniently enables the practitioner to construct ordered and repetitive antigen 
or antigenic determinant arrays for various treatment purposes, which includes the 
prevention of infectious diseases, the treatment of allergies and the treatment of 
cancers. The invention also enables the practitioner to construct compositions 
comprising Pili inducing Th2 immune responses, useful in the treatment of chronic 
diseases. 

[0104] Compositions of the invention essentially comprise, or alternatively consist 

of, two elements: (1) a non-natural molecular scaffold; and (2) an antigen or 
antigenic determinant with at least one second attachment site capable of 

r 

association through at least one non-peptide bond to said first attachment site. 
[0105] The non-natural molecular scaffold comprises, or alternatively consists of: 

(a) a core particle selected from the group consisting of (1) a core particle of non- 
natural origin and (2) a core particle of natural origin; and (b) an organizer 
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comprising at least one first attachment site, wherein said organizer is connected 
to said core particle by at least one covalent bond. 

[0106] Compositions of the invention also comprise, or alternatively consist of 5 

core particles to which antigens or antigenic determinants are directly linked. 

[0107] The antigen or antigenic determinant has at least one second attachment 

site which is selected from the group consisting of (a) an attachment site not 
naturally occurring with said antigen or antigenic determinant; and (b) an 
attachment site naturally occurring with said antigen or antigenic determinant. 

[0108] The inventicm provides for an ordered and repetitive antigen array through 

an association of the second attachment site to the first attachment site by way of 
at least one non-peptide bond. Thus, the antigen or antigenic determinant and the 
non-natural molecular scaffold are brought together through this association of the 
first and the second attachment site to form an ordered and repetitive antigen 
array. 

[0109] The practioner may specifically design the antigen or antigenic determinant 

and the second attachment site such that the arrangement of all the antigens or 
antigenic determinants bound to the non-natural molecular scaffold or, in certain 
embodiments, the core particle will be uniform. For example, one may place a 
single second attachment site on the antigen or antigenic determinant at the 
carboxyl or amino terminus, thereby ensuring through design that all antigen or 
antigenic determinant molecules that are attached to the non-natural molecular 
scaffold are positioned in a uniform way. Thus, the invention provides a 
convenient means of placing any antigen or antigenic determinant onto a 
non-natural molecular scaffold in a defined order and in a manner which forms a 
repetitive pattern. 

[0110] As will be clear to those skilled in the art, certain embodiments of the 

invention involve the use of recombinant nucleic acid technologies such as cloning, 
polymerase chain reaction, the purification of DNA and UNA, the expression of 
recombinant proteins in prokaryotic and eukaryotic cells, etc. Such 
methodologies are well known to those skilled in the art and may be conveniently 
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found in published laboratory methods manuals (e.g., Sambrook, J. et aL, eds., 
Molecular Cloning, A Laboratory Manual, 2nd. edition, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N. Y. (1989); Ausubel, F. et a/., 
eds., Current Protocols nsrMoLECULARBiOLOGY, JohnH. Wiley & Sons, Inc. 
(1997)). Fundamental laboratory techniques for working with tissue culture cell 
lines (Celis, J., ed., CELL BIOLOGY, Academic Press, 2 nd edition, (1998)) and 
antibody-based technologies (Harlow, E, and Lane, D., "Antibodies: ALaboratory 
Manual," Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1988); 
Deutscher,M.P., "Guide to Protein Purification, " Meih. Enzymol 128, Academic 
Press San Diego (1990); Scopes, R.K., "Protein Purification Principles and 
Practice," 3 rd ed., Springer- Verlag, New York (1994)) are also adequately 
described in the literature, all of which are incorporated herein by reference. 

A. Construction of a Non-Natural Molecular Scaffold 

[0111] One element in compositions of the invention is a non-natural molecular 

scaffold comprising, or alternatively consisting of, a core particle and an organizer. 
As used herein, the phrase "non-natural molecular scaffold" refers to any product 
made by the hand of man that may serve to provide a rigid and repetitive array of 
first attachment sites. More specifically, the non-natural molecular scaffold 
comprises, or alternatively consists of, (a) a core particle selected from the group 
consisting of (1) a core particle of non-natural origin and (2) a core particle of 
natural origin; and (b) an organizer comprising at least one first attachment site, 
wherein said organizer is connected to said core particle by at least one covalent 
• bond. 

[0112] As will be readily apparent to those skilled in the art, the core particle of 

the non-natural molecular scaffold of the invention is not limited to any specific 
form. The core particle may be organic or non-organic and may be synthesized 
chemically or through a biological process. 
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[0113] In one embodiment, a non-natural core particle may be a synthetic 

polymer, a lipid micelle or a metal Such core particles are known in the art, 
providing a basis from which to build the novel non-natural molecular scaffold of 
the invention. By way of example, synthetic polymer or metal core particles are 
described in U.S. Patent No. 5,770,380, which discloses the use of a calixarene 
organic scaffold to which is attached a plurality of peptide loops in the creation of 
an 'antibody mimic 3 , and U.S. Patent No. 5,334,394 describes nanocrystalline 
particles used as a viral decoy that are composed of a wide variety of inorganic 
materials, including metals or ceramics. Suitable metals include chromium, 
rubidium, iron, zinc, selenium, nickel, gold, silver, platinum. Suitable ceramic 
materials in this embodiment include silicon dioxide, titanium dioxide, aluminum 
oxide, ruthenium oxide and tin oxide. The core particles of this embodiment may 
be made from organic materials including carbon (diamond). Suitable polymers 
include polystyrene, nylon and nitrocellulose. For this type of nanocrystalline 
particle, particles made from tin oxide, titanium dioxide or carbon (diamond) are 
may also be used. A lipid micelle may be prepared by any means known in the art. 
For example micelles may be prepared according to the procedure of Baiselle and 
Millar (Biophys. Chem. 4:355-361 (1975)) or Corti et al (Chem. Phys. Lipids 
35:197-214 (1981)) or Lopez et al (FEBS Lett 426:314-318 (1998)) or 
Topchieva and Karezin (J. Colloid Interface Set 213:29-35 (1999)) or Morein et 
al, (Nature 305:457-460 (1984)), which are all incorporated herein by reference. 

[0114] The core particle may also be produced through a biological process, 

which may be natural or non-natural. By way of example, this type of 
embodiment may includes a core particle comprising, or alternatively consisting 
of, a virus, virus-like particle, a bacterial pilus, a phage, a viral capsid particle or 
a recombinant form thereof. In a more specific embodiment, the core particle may 
comprise, or alternatively consist of, recombinant proteins of Rotavirus, 
recombinant proteins of Norwalk virus, recombinant proteins of Alphavirus, 
recombinant proteins which form bacterial pili or pilus-like structures, 
recombinant proteins of Foot and Mouth Disease virus, recombinant proteins of 
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Retrovirus, recombinant proteins of Hepatitis B virus (e.g., a HBcAg), 
recombinant proteins of Tobacco mosaic virus, recombinant proteins of Flock 
House Virus, and recombinant proteins of human Papilomavirus. 

[0115] Whether natural or non-natural, the core particle of the invention will 

generally have an organizer that is attached to the natural or non-natural core 
particle by at least one covalent bond. The organizer is an element bound to a 
core particle in a non-random fashion that provides a nucleation site for creating 
an ordered and repetitive antigen array. Ideally, but not necessarily, the organizer 
is associated with the core particle in a geometric order. Minimally, the organizer 
comprises a first attachment site. 

[0116] In some embodiments of the invention, the ordered and repetitive array is 

formed by association between (1) either core particles or non-natural molecular 
scaffolds and (2) an antigen or antigenic determinant. For example, bacterial pili 
or pilus-like structures are formed from proteins which are organized into ordered 
and repetitive structures. Thus, in many instances, it will be possible to form 
ordered arrays of antigens or antigenic determinants by linking these constituents 
to bacterial pili or pili-like structures. 

[0117] As previously stated, the organizer may be any element comprising at least 

one first attachment site that is bound to a core particle by at least one covalent 
bond. An organizer may be a protein, a polypeptide, a peptide, an amino acid 
(i.e., a residue of a protein, a polypeptide or peptide), a sugar, a polynucleotide, 
a natural or synthetic polymer, a secondary metabolite or compound (biotin, 
fluorescein, retinol, digoxigenin, metal ions, phenylmethylsulfonylfluoride), or a 
combination thereof, or a chemically reactive group thereof. In a more specific 
embodiment, the organizer may comprise a first attachment site comprising an 
antigen, an antibody or antibody fragment, biotin, avidin, strepavidin, a receptor, 
a receptor ligand, a ligand, a ligand-binding protein, an interacting leucine zipper 
polypeptide, an amino group, a chemical group reactive to an amino group; a 
carboxyl group, chemical group reactive to a carboxyl group, a sulfhydryl group, 
a chemical group reactive to a sulfhydryl group, or a combination thereof 
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[0118] In one embodiment, the core particle of the non-natural molecular scaffold 

comprises a virus, a bacterial pilus, a structure formed from bacterial pilin, a 
bacteriophage, a virus-like particle, a viral capsid particle or a recombinant form 
thereof Any virus known in the art having an ordered and repetitive coat and/or 
core protein structure may be selected as a non-natural molecular scaffold of the 
invention; examples of suitable viruses include: sindbis and other alphaviruses; 
vesicular somatitis virus; rhabdo-, (e.g. vesicular stomatitis virus), picorna-, toga-, 
orthomyxo-, polyoma-, parvovirus, rotavirus, Norwalk virus, foot and mouth 
disease virus, a retrovirus, Hepatitis B virus, Tobacco mosaic virus, flock house 
virus, human papilomavirus (for example, see Table 1 in Bachman, M.F. and 
Zinkernagel, R.M., Immunol. Today 77:553-558 (1996)). 

[0119] In one embodiment, the invention utilizes genetic engineering of a virus to 

create a fusion between an ordered and repetitive viral envelope protein and an 
organizer comprising a heterologous protein, peptide, antigenic determinant or a 
reactive amino acid residue of choice. Other genetic manipulations known to 
those in the art may be included in the construction of the non-natural molecular 
scaffold; for example, it may be desirable to restrict the replication ability of the 
recombinant virus through genetic mutation. The viral protein selected for fusion 
to the organizer (i.e., first attachment site) protein should have an organized and 
repetitive structure. Such an organized and repetitive structure include 
paracrystalline organizations with a spacing of 5 - 1 5 nm on the surface of the virus. 
The creation of this type of fusion protein will result in multiple, ordered and 
repetitive organizers on the surface of the virus. Thus, the ordered and repetitive 
organization of the first attachment sites resulting therefrom will reflect the normal 
organization of the native viral protein. 

[0120] As will be discussed in more detail herein, in another embodiment of the 

invention, the non-natural molecular scaffold is a recombinant alphavirus, and 
more specifically, a recombinant Sinbis virus. Alphaviruses are positive stranded 
KNA viruses that replicate their genomic RNA entirely in the cytoplasm of the 
infected cell and without a DNA intermediate (Strauss, J. and Strauss, E., 
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Microbiol Rev. 55:491-562 (1994)). Several members of the alphavirus family, 
Sindbis (Xiong, C. etal, Science 243: 1 188-1 191 (1989); Schlesinger, S., Trends 
Biotechnol 77:18-22 (1993)), Semliki Forest Virus (SFV) (Liljestrom, P. & 
Garoff, H., Bio/Technology 9\ 1356-1361 (1991)) and others (Davis, NX. et al, 
Virology 777:189-204 (1989)), have received considerable attention for use as 
virus-based expression vectors for a variety of different proteins (Lundstrom, K., 
Curr. Opin. Biotechnol 5:578-582 (1997); Liljestrom, P., Curr. Opin. 
Biotechnol 5:495-500 (1994)) and as candidates for vaccine development. 
Recently, a number of patents have issued directed to the use of alphaviruses for 
the expression of heterologous proteins and the development of vaccines (see U. S . 
Patent Nos. 5,766,602; 5,792,462; 5,739,026; 5,789,245 and 5,814,482). The 
construction of the alphaviral scaffold of the invention may be done by means 
generally known in the art of recombinant DNA technology, as described by the 
aforementioned articles, which are incorporated herein by reference. 

[0121] A variety of different recombinant host cells can be utilized to produce a 

viral-based core particle for antigen or antigenic determinant attachment. For 
example, Alphaviruses are known to have a wide host range; Sindbis virus infects 
cultured mammalian, reptilian, and amphibian cells, as well as some insect cells 
(Clark, H., J. Natl Cancer Inst 57:645 (1973); Leake, C, J. Gen. Virol 35:335 
(1977); Stollar, V. in THE TOGA VIRUSES, R.W. Schlesinger, Ed., Academic Press, 
(1980), pp. 583-621). Thus, numerous recombinant host cells can be used in the 
practice of the invention. BHK, COS, Vero, HeLa and CHO cells are particularly 
suitable for the production of heterologous proteins because they have the 
potential to glycosylate heterologous proteins in a manner similar to human cells 
(Watson, E. etal, Glycobiology 4:227 \ (1994)) and can be selected (Zang, M. et 
al, Bio/Technology 75:389 (1995)) or genetically engineered (Renner W. et al, 
Biotech. Bioeng. 4:476 (1995); Lee K. etal Biotech. Bioeng. 50:336 (1996)) to 
grow in serum-free medium, as well as in suspension. 

[0122] Introduction of the polynucleotide vectors into host cells can be effected 

by methods described in standard laboratory manuals (see f e.g., Sambrook, J. et 
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a/., eds., Molecular Cloning, A Laboratory Manual, 2nd. edition, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y. (1989), Chapter 9; 
Ausubel, F. etaL, eds., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, JohnH. 
Wiley & Sons, Inc. (1997), Chapter 16), including methods such as 
electr op oration, DEAE-dextran mediated transfection, transfection, 
microinjection, cationic lipid-mediated transfection, transduction, scrape loading, 
. ballistic introduction, and infection. Methods for the introduction of exogenous 
DNA sequences into host cells are discussed in Feigner, P. etaL, U.S. Patent No. 
5,580,859. 

[0123] Packaged RNA sequences can also be used to infect host cells. These 

packaged RNA sequences can be introduced to host cells by adding them to the 
culture medium. For example, the preparation of non-infective alpahviral particles 
is described in a number of sources, including "Sindbis Expression System", 
Version C (Invitrogen Catalog No. K750-1). 

[0124] When mammalian cells are used as recombinant host cells for the 

production of viral-based core particles, these cells will generally be grown in 
tissue culture. Methods for growing cells in culture are well known in the art (see, 
e.g., Celis, J., ed., CELLBlOLOGY, Academic Press, 2 nd edition, (1998); Sambrook, 
J. et aL, eds., MOLECULAR CLONING, A LABORATORY MANUAL, 2nd. edition, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989); Ausubel, 
F. etaL, eds., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John H. Wiley & 
Sons, Inc. (1997); Freshney, R., CULTURE OF ANIMAL CELLS, Alan R. Liss, Inc. 
(1983)). 

[0125] As will be understood by those in the art, the first attachment site may be 

or be a part of any suitable protein, polypeptide, sugar, polynucleotide, peptide 
(amino acid), natural or synthetic polymer, a secondary metabolite or combination 
thereof that may serve to specifically attach the antigen or antigenic determinant 
of choice to the non-natural molecular scaffold. In one embodiment, the 
attachment site is a protein or peptide that may be selected from those known in 
the art. For example, the first attachment site may selected from the following 
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group: a ligand, a receptor, a lectin, avidin, streptavidin, biotin, an epitope such 
as an HA or T7 tag, Myc 5 Max, immunoglobulin domains and any other amino 
acid sequence known in the art that would be useful as a first attachment site. 

[0126] It should be further understood by those in the art that with another 

embodiment of the invention, the first attachment site may be created secondarily 
to the organizer (i.e., protein or polypeptide) utilized in constructing the in-frame 
fusion to the capsid protein. For example, a protein may be utilized for fusion to 
the envelope protein with an amino acid sequence known to be glycosylated in a 
specific fashion, and the sugar moiety added as a result may then serve at the first 
attachment site of the viral scaffold by way of binding to a lectin serving as the 
secondary attachment site of an antigen. Alternatively, the organizer sequence 
may be biotinylated in vivo and the biotin moiety may serve as the first attachment 
site of the invention, or the organizer sequence may be subjected to chemical 
modification of distinct amino acid residues in vitro, the modification serving as 
the first attachment site. 

[0127] One specific embodiment of the invention utilizes the Sinbis virus. The 

Sinbis virus RNA genome is packaged into a capsid protein that is surrounded by 
a lipid bilayer containing three proteins called El, E2, and E3. These so-called 
envelope proteins are glycoproteins, and the glycosylated portions are located on 
the outside of the lipid bilayer, where complexes of these proteins form the 
"spikes" that can be seen in electron micrographs to project outward from the 
surface of the virus. In another embodiment of the invention, the first attachment 
site is selected to be the JUN or FOS leucine zipper protein domain that is fused 
in frame to the E2 envelope protein. However, it will be clear to all individuals 
in the art that other envelope proteins may be utilized in the fusion protein 
construct for locating the first attachment site in the non-natural molecular 
scaffold of the invention. 

[0128] In a specific embodiment of the invention, the first attachment site is 

selected to be the JUN-FOS leucine zipper protein domain that is fused in frame 
to the Hepatitis B capsid (core) protein (HBcAg). However, it will be clear to all 
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individuals in the art that other viral capsid proteins may be utilized in the fusion 
protein construct for locating the first attachment site in the non-natural molecular 
scaffold of the invention. 

[0129] In another specific embodiment of the invention, the first attachment site 

is selected to be a lysine or cysteine residue that is fused in frame to the HBcAg. 
However, it will be clear to all individuals in the art that other viral capsid or 
virus-like particles may be utilized in the fusion protein construct for locating the 
first attachment in the non-natural molecular scaffold of the invention. 

[0130] Example 1 is provided to demonstrate the construction of an in-frame 

fusion protein between the Sinbis virus E2 envelope protein and the JUN leucine 
zipper protein domain using the pTE5'2J vector of Hahn et al (Proc. Natl. Acad. 
Set USA 89: 2679-2683 (1992)). The JUN amino acid sequence utilized for the 
first attachment site is the following: CGGRIARLEEKVKTLKAQNSE 
LASTANMLREQVAQLKQKVMNHVGC (SEQ ID NO: 59). In this instance, 
the anticipated second attachment site on the antigen would be the FOS leucine 
zipper protein domain and the amino acid sequence would be the following: 
CGGLTDTLQ AETDQ VEDEKS ALQTEIANLLKEKEKLEFILAAHGGC (SEQ 
IDNO:60) 

[0131] These sequences are derived from the transcription factors JUN and FOS, 

each flanked with a short sequence containing a cysteine residue on both sides. 
These sequences are known to interact with each other. The original hypothetical 
structure proposed for the JUN-FOS dimer assumed that the hydrophobic side 
chains of one monomer interdigitate with the respective side chains of the other 
monomer in a zipper-like manner (Landschulz et al, Science 240:1759-1764 

(1988) ). However, this hypothesis proved to be wrong, and these proteins are 
known to form an a-helical coiled coil (O'Shea et al, Science 243:538-542 

(1989) ; O'Shea et al, Cell 65:699-708 (1992); Cohen & Parry, Trends Biochem. 
Set 77:245-248 (1986)). Thus, the term "leucine zipper' 1 is frequently used to 
refer to these protein domains for more historical than structural reasons. 
Throughout this patent, the term "leucine zipper" is used to refer to the sequences 
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depicted above or sequences essentially similar to the ones depicted above. The 
terms JUN and FOS are used for the respective leucine zipper domains rather than 
the entire JUN and FOS proteins. 

[0132] In one embodiment, the invention provides for the production of a Sinbis 

virus B2-JUN scaffold using the pCYTts expression system (WO 99/50432). The 
pC YTts expression system provides novel expression vectors which permit tight 
regulation of gene expression in eucaryotic cells. The DNA vectors of this system 
are transcribed to form RNA molecules which are then replicated by a 
temperature-sensitive replicase to form additional RNA molecules. The RNA 
molecules produced by replication contain a nucleotide sequence which may be 
translated to produce a protein of interest or which encode one or more 
untranslated RNA molecules. Thus the expression system enables the production 
of recombinant Sinbis virus particles. 

[0133] Example 2 provides details on the production of the E2-JUN Sinbis non- 

natural molecular scaffold of the invention. Additionally provided in Example 3 
is another method for the production of recombinant E2-/6WSinbis virus scaffold 
using the pTE5 / 2JE2:JLW vector produced in Example 1. Thus the invention 
provides two means, the pCYTts expression system (Example 2) and the pTE5' 2 J 
vector system (Example 3) by which recombinant Sinbis virus E2-JUN non-natural 
molecular scaffold may be produced. An analysis of viral particles produced in 
each system is provided in Figure 1 and Figure 2. 

[0134] As previously stated, the invention includes viral-based core particles 

which comprise, or alternatively consist of, a virus, virus-like particle, a phage, a 
viral capsid particle or a recombinant form thereof. Skilled artisans have the 
knowledge to produce such core particles and attach organizers thereto. By way 
of providing other examples, the invention provides herein for the production of 
Hepatitis B virus-like particles and measles viral capsid particles as core particles 
(Examples 17 to 22). In such an embodiment, the JUN leucine zipper protein 
domain or FOS leucine zipper protein domain may be used as an organizer, and 
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hence as a first attachment site, for the non-natural molecular scaffold of the 
invention. 

[0135] Examples 23-29 provide details of the production of Hepatitis B core 

particles carrying an in-frame fused peptide with a reactive lysine residue and 
antigens carrying a genetically fused cysteine residue, as first and second 
attachment site, respectively. 

[0136] In other embodiments, the core particles used in compositions of the 

invention are composed of a Hepatitis B capsid (core) protein (HBcAg), or 
fragment thereof, which has been modified to either eliminate or reduce the 
number of free cysteine residues. Zhou et al. (J. Virol 66:5393-5398 (1992)) 
demonstrated that HBcAgs which have been modified to remove the naturally 
resident cysteine residues retain the ability to associate and form multimeric 
structures. Thus, core particles suitable for use in compositions of the invention 
include those comprising modified HBcAgs, or fragments thereof, in which one 
or more of the naturally resident cysteine residues have been either deleted or 
substituted with another amino acid residue (e.g., a serine residue). 

[0137] The HBcAg is a protein generated by the processing of a Hepatitis B core 

antigen precursor protein. A number of isotypes of the HBcAg have been 
identified. For example, the HBcAg protein having the amino acid sequence 
shown in SEQ ID NO: 132 is 183 amino acids in length and is generated by the 
processing of a 212 amino acid Hepatitis B core antigen precursor protein. This 
processing results in the removal of 29 amino acids from the N-terminus of the 
Hepatitis B core antigen precursor protein. Similarly, the HBcAg protein having 
the amino acid sequence shown in SEQ ID NO: 134 is 185 amino acids in length 
and is generated by the processing of a 214 amino acid Hepatitis B core antigen 
precursor protein. The amino acid sequence shown in SEQ ID NO: 134, as 
compared to the amino acid sequence shown in SEQ ID NO: 132, contains a two 
amino acid insert at positions 152 and 153 in SEQ ID NO: 134. 

[0138] In most instances, vaccine compositions of the invention will be prepared 

using the processed form of a HBcAg (Le. , a HBcAg from which the N-terminal 
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leader sequence (e.g. , the first 29 amino acid residues shown in SEQ ID NO: 134) 
of the Hepatitis B core antigen precursor protein have been removed). 

[0139] Further, when HBcAgs are produced under conditions where processing 

will not occur, the HBcAgs will generally be expressed in "processed" form. For 
example, bacterial systems, such as E. coli, generally do not remove the leader 
sequences of proteins which are normally expressed in eukaryotic cells. Thus, 
when an E. coli expression system is used to produce HBcAgs of the invention, 
these proteins will generally be expressed such that the N-t erminal leader sequence 
of the Hepatitis B core antigen precursor protein is not present. 

[0140] In one embodiment of the invention, a modified HBcAg comprising the 

amino acid sequence shown in SEQ ID NO: 134, or subportion thereof, is used to 
prepare non-natural molecular scaffolds. In particular, modified HBcAgs suitable 
for use in the practice of the invention include proteins in which one or more of 
the cysteine residues at positions corresponding to positions 48, 61, 107 and 185 
of a protein having the amino acid sequence shown in SEQ ID NO: 134 have been 
either deleted or substituted with other amino acid residues (e.g. , a serine residue). 
As one skilled in the art would recognize, cysteine residues at similar locations in 
HBcAg variants having amino acids sequences which differ from that shown in 
SEQ ID NO: 134 could also be deleted or substituted with other amino acid 
residues. The modified HBcAg variants can then be used to prepare vaccine 
compositions of the invention. 

[0141] The present invention also includes HBcAg variants which have been 

modified to delete or substitute one or more additional cysteine residues which are 
not found in polypeptides having the amino acid sequence shown in SEQ ID 
NO: 134. Examples of such HBcAg variants have the amino acid sequences 
shown in SEQ ID NOs:90 and 132. These variant contain cysteines residues at 
a position corresponding to amino acid residue 147 in SEQ ID NO: 134. Thus, the 
vaccine compositions of the invention include compositions comprising HBcAgs 
in which cysteine residues not present in the amino acid sequence shown in SEQ 
ID NO: 134 have been deleted. 
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[0 1 42] Under certain circumstances (e.g. , when a heterobiflxnctional cross-linking 

reagent is used to attach antigens or antigenic determinants to the non-natural 
molecular scaffold), the presence of free cysteine residues in the HBcAg is 
believed to lead to covalent coupling of toxic components to core particles, as 
well as the cross-linking of monomers to form undefined species. 

[0143] Further, in many instances, these toxic components may not be detectable 

with assays performed on compositions of the invention. This is so because 
covalent coupling of toxic components to the non-natural molecular scaffold 
would result in the formation of a population of diverse species in which toxic 
components are linked to different cysteine residues, or in some cases no cysteine 
residues, of the HBcAgs. In other words, each free cysteine residue of each 
HBcAg will not be covalently linked to toxic components. Further, in many 
instances, none of the cysteine residues of particular HBcAgs will be linked to 
toxic components. Thus, the presence of these toxic components may be difficult 
to detect because they would be present in a mixed population of molecules. The 
administration to an individual of HBcAg species containing toxic components, 
however, could lead to a potentially serious adverse reaction. 

[0144] It is well known in the art that free cysteine residues can be involved in a 

number of chemical side reactions. These side reactions include disulfide 
exchanges, reaction with chemical substances or metabolites that are, for example, 
injected or formed in a combination therapy with other substances, or direct 
oxidation and reaction with nucleotides upon exposure to UV light. Toxic 
adducts could thus be generated, especially considering the fact that HBcAgs have 
a strong tendency to bind nucleic acids. Detection of such toxic products in 
antigen-capsid conjugates would be difficult using capsids prepared using HBcAgs 
containing free cysteines and heterobifunctional cross-linkers, since a distribution 
of products with a broad range of molecular weight would be generated. The 
toxic adducts would thus be distributed between a multiplicity of species, which 
individually may each be present at low concentration, but reach toxic levels when 
together. 
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[0145] In view of the above, one advantage to the use of HBcAgs in vaccine 

compositions which have been modified to remove naturally resident cysteine 
residues is that sites to which toxic species can bind when antigens or antigenic 
determinants are attached to the non-natural molecular scaffold would be reduced 
in number or eliminated altogether. Further, a high concentration of cross-linker 
can be used to produce highly decorated particles without the drawback of 
generating a plurality ofundefined cross-linked species ofHBcAg monomers (i.e. , 
a diverse mixture of cross-linked monomeric HbcAgs). 

[0146] A number of naturally occurring HBcAg variants suitable for use in the 

practice of the present invention have been identified. Yuan et al, (X Virol 
73:10122-10128 (1999)), for example, describe variants in which the isoleucine 
residue at position corresponding to position 97 in SEQ ID NO: 134 is replaced 
with either a leucine residue or a phenylalanine residue. The amino acid sequences 
of a number of HBcAg variants, as well as several Hepatitis B core antigen 
precursor variants, are disclosed in GenBank reports AAF121240 (SEQ ID 
NO:89), AF121239 (SEQ ID NO:90), X85297 (SEQ ID NO:91), X02496 (SEQ 
ID NO:92), X85305 (SEQ ID NO:93), X85303 (SEQ ID NO:94), AF151735 
(SEQ ID NO:95), X85259 (SEQ IDNO:96), X85286 (SEQ IDNO:97), X85260 
(SEQ ID NO:98), X85317 (SEQ ID NO:99), X85298 (SEQ ID NO: 100), 
AF043593 (SEQ ID NO: 101), M20706 (SEQ ID NO: 102), X85295 (SEQ ID 
NO:103), X80925 (SEQ ID NO: 104), X85284 (SEQ ID NO: 105), X85275 (SEQ 
ID NO: 106), X72702 (SEQ ID NO: 107), X85291 (SEQ ID NO: 108), X65258 
(SEQ ID NO:109), X85302 (SEQ ID NO:110), M32138 (SEQ ID NO:lll), 
X85293 (SEQ ID NO: 112), X85315 (SEQ ID NO: 113), U95551 (SEQ ID 
NO:114), X85256(SEQIDNO:115), X85316(SEQIDNO:116),X85296 (SEQ 
ID NO: 1 17), AB033559 (SEQ ID NO: 1 18), X59795 (SEQ ID NO: 119), X85299 
(SEQ ID NO:120), X85307 (SEQ ID NO:121), X65257 (SEQ ID NO: 122), 
X85311 (SEQ ID NO: 123), X85301 (SEQ ID NO: 124), X85314 (SEQ ID 
NO: 125), X85287 (SEQ ID NO: 126), X85272 (SEQ ID NO: 127), X853 19 (SEQ 
ID NO: 128), AB010289 (SEQ ID NO:129), X85285 (SEQ ID NO:130), 
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AB010289 (SEQ ID NO: 13 1), AF121242 (SEQ ID NO: 132), M90520 (SEQ ID 
NO:135),P03153 (SEQ ID NO: 136), AF1 10999 (SEQ ID NO: 137), andM95589 
(SEQ ID NO: 138), the disclosures of each of which are incorporated herein by 
reference. These HBcAg variants differ in amino acid sequence at a number of 
positions, including amino acid residues which corresponds to the amino acid 
residues located at positions 12, 13, 21, 22, 24, 29, 32, 33, 35, 38, 40, 42, 44, 45, 
49, 51, 57, 58, 59, 64, 66, 67, 69, 74, 77, 80, 81, 87, 92, 93, 97, 98, 100, 103, 
105, 106, 109, 113, 116, 121, 126, 130, 133, 135, 141, 147, 149, 157, 176, 178, 
182 and 183 in SEQ ID NO: 134. The invention is also directed to amino acid 
sequences that are at least 65, 0, 75, 80, 85, 90 or 95% identical to the above 
Hepatitis B viral capsid protein sequences. HBcAgs suitable for use in the present 
invention may be derived from any organism so long as they are able to associate 
to form an ordered and repetitive antigen array, 

[0147] As noted above, generally processed HBcAgs (/. e. , those which lack leader 

sequences) will be used in the vaccine compositions of the invention. Thus, when 
HBcAgs having amino acid sequence shown in SEQ ID NOs: 136, 137, or 138 are 
used to prepare vaccine compositions of the invention, generally 30, 35-43, or. 
35-43 amino acid residues at the N-terminus, respectively, of each of these 
proteins will be omitted. 

[0148] The present invention includes vaccine compositions, as well as methods 

for using these compositions, which employ the above described variant HBcAgs 
for the preparation of non-natural molecular scaffolds. 

[0149] Further included withing the scope of the invention are additional HBcAg 

variants which are capable of associating to form dimeric or multimeric structures. 
Thus, the invention further includes vaccine compositions comprising HBcAg 
polypeptides comprising, or alternatively consisting of, amino acid sequences 
which are at least 80%, 85%, 90%, 95%, 97%, or 99% identical to any of the 
amino acid sequences shown in SEQ ID NOs:89-132 and 134-138, and forms of 
these proteins which have been processed, where appropriate, to remove the 
N-terminal leader sequence. 
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[0150] Whether the amino acid sequence of a polypeptide has an amino acid 

sequence that is at least 80%, 85%, 90%, 95%, 97%, or 99% identical to one of 
the amino acid sequences shown in SEQ ID NOs:89-132 and 134-138, or a 
subportion thereof, can be determined conventionally using known computer 
programs such the Bestfit program. When using Bestfit or any other sequence 
alignment program to determine whether a particular sequence is, for instance, 
95% identical to a reference amino acid sequence according to the present 
invention, the parameters are set such that the percentage of identity is calculated 
over the full length of the reference amino acid sequence and that gaps in 
homology of up to 5% of the total number of amino acid residues in the reference 
sequence are allowed. 

[0151] The HBcAg variants and precursors having the amino acid sequences set 

out in SEQ ID NOs:89-132 and 134-136 are relatively similar to each other. 
Thus, reference to an amino acid residue of a HBcAg variant located at a position 
which corresponds to a particular position in SEQ ID NO: 134, refers to the amino 
acid residue which is present at that position in the amino acid sequence shown in 
SEQ ID NO: 134. The homology between these HBcAg variants is for the most 
part high enough among Hepatitis B viruses that infect mammals so that one 
skilled in the art would have little difficulty reviewing both the amino acid 
sequence shown in SEQ ID NO: 134 and that of a particular HBcAg variant and 
identifying "corresponding" amino acid residues. For example, the HBcAg amino 
acid sequence shown in SEQ ID NO: 135, which shows the amino acid sequence 
of a HBcAg derived from a virus which infect woodchucks, has enough homology 
to the HBcAg having the amino acid sequence shown in SEQ ID NO: 134 that it 
is readily apparent that a three amino acid residue insert is present in SEQ ID 
NO:135 between amino acid residues 155 and 156 of SEQ ID NO: 134. 

[0152] The HBcAgs of Hepatitis B viruses which infect snow geese and ducks 

differ enough from the amino acid sequences of HBcAgs of Hepatitis B viruses 
which infect mammals that alignment of these forms of this protein with the amino 
acid sequence shown in SEQ ID NO: 134 is difficult. However, the invention 
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includes vaccine compositions which comprise HBcAg variants of Hepatitis B 
viruses which infect birds, as wells as vaccine compositions which comprise 
fragments of these HBcAg variants. HBcAg fragments suitable for use in 
preparing vaccine compositions of the invention include compositions which 
contain polypeptide fragments comprising, or alternatively consisting of, amino 
acid residues selected from the group consisting of 36-240, 36-269, 44-240, 
44-269, 36-305, and 44-305 of SEQ ID NO: 137 or 36-240, 36-269, 44-240, 
44-269, 36-305, and 44-305 of SEQ ID NO: 138. As one skilled in the art would 
recognize, one, two, three or more of the cysteine residues naturally present in 
these polypeptides (e.g., the cysteine residues at position 153 is SEQ ID NO: 137 
or positions 34, 43, and 196 in SEQ ID NO: 138) could be either substituted with 
another amino acid residue or deleted prior to their inclusion in vaccine 
compositions of the invention. 

[0153] In one embodiment, the cysteine residues at positions 48 and 107 of a 

protein having the amino acid sequence shown in SEQ ID NO: 134 are deleted or 
substituted with another amino acid residue but the cysteine at position 61 is left 
in place. Further, the modified polypeptide is then used to prepare vaccine 
compositions of the invention. 

[0154] As set out below in Example 3 1 5 the cysteine residues at positions 48 and 

107, which are accessible to solvent, may be removed, for example, by 
site-directed mutagenesis. Further, the inventors have found that the Cys-48-Ser, 
Cys-107-Ser HBcAg double mutant constructed as described in Example 3 1 can 
be expressed in E. coli. 

[0155] As discussed above, the elimination of free cysteine residues reduces the 

number of sites where toxic components can bind to the HBcAg, and also 
eliminates sites where cross-linking of lysine and cysteine residues of the same or 
of neighboring HBcAg molecules can occur. The cysteine at position 61, which 
is involved in dimer formation and forms a disulfide bridge with the cysteine at 
position 61 of another HBcAg, will normally be left intact for stabilization of 
HBcAg dimers and multimers of the invention. 
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[0156] As shown in Example 32, cross-linking experiments performed with 

( 1 ) HBcAgs containing free cysteine residues and (2) HBcAgs whose free cysteine 
residues have been made unreactive with iodacetamide, indicate that free cysteine 
residues of the HBcAg are responsible for cross-linking between HBcAgs through 
reactions between heterobifunctional cross-linker derivatized lysine side chains, 
and free cysteine residues. Example 32 also indicates that cross-linking of HBcAg 
subunits leads to the formation of high molecular weight species of undefined size 
which cannot be resolved by SDS-polyacrylamide gel electrophoresis. 

[0157] When an antigen or antigenic determinant is linked to the non-natural 

molecular scaffold through a lysine residue, it may be advantageous to either 
substitute or delete one or both of the naturally resident lysine residues located at 
positions corresponding to positions 7 and 96 in SEQ ID NO: 134, as well as other 
lysine residues present in HBcAg variants. The elimination of these lysine residues 
results in the removal of binding sites for antigens or antigenic determinants which 
could disrupt the ordered array and should improve the quality and uniformity of 
the final vaccine composition. 

[0158] In many instances, when both of the naturally resident lysine residues at 

positions corresponding to positions 7 and 96 in SEQ ID NO: 134 are eliminated, 
another lysine will be introduced into the HBcAg as an attachment site for an 
antigen or antigenic determinant. Methods for inserting such a lysine residue are 
set out, for example, in Example 23 below. It will often be advantageous to 
introduce a lysine residue into the HBcAg when, for example, both of the naturally 
resident lysine residues at positions corresponding to positions 7 and 96 in SEQ 
ID NO: 134 are altered and one seeks to attach the antigen or antigenic 
determinant to the non-natural molecular scaffold using a heterobifunctional 
cross-linking agent. 

[0159] The C-terminus of the HBcAg has been shown to direct nuclear 

localization of this protein. (Eckhardt et aL, X Virol. 55:575-582 (1991).) 
Further, this region of the protein is also believed to confer upon the HBcAg the 
ability to bind nucleic acids. 
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[0160] In some embodiments, vaccine compositions of the invention will contain 

HBcAgs which have nucleic acid binding activity (e.g., which contain a naturally 
resident HBcAg nucleic acid binding domain). HBcAgs containing one or more 
nucleic acid binding domains are useful for preparing vaccine compositions which 
exhibit enhanced T-cell stimulatory activity. Thus, the vaccine compositions of 
the invention include compositions which contain HBcAgs having nucleic acid 
binding activity. Further included are vaccine compositions, as well as the use of 
such compositions in vaccination protocols, where HBcAgs are bound to nucleic 
acids. These HBcAgs may bind to the nucleic acids prior to administration to an 
individual or may bind the nucleic acids after administration. 

[0161] In other embodiments, vaccine compositions of the invention will contain 

HBcAgs from which the C-terminal region (e.g., amino acid residues 145-185 or 
150-185 of SEQ ID NO: 134) has been removed and do not bind nucleic acids. 
Thus, additional modified HBcAgs suitable for use in the practice of the present 
invention include C-terminal truncation mutants. Suitable truncation mutants 
include HBcAgs where 1, 5, 10, 15, 20, 25, 30, 34, 35, 36, 37, 38, 39 40, 41, 42 
or 48 amino acids have been removed from the C -terminus. 

[0162] HBcAgs suitable for use in the practice of the present invention also 

include N-terminal truncation mutants. Suitable truncation mutants include 
modified HBcAgs where 1, 2, 5, 7, 9, 10, 12, 14, 15, and 17 amino acids have 
been removed from the N-terminus. 

[0163] Further HBcAgs suitable for use in the practice of the present invention 

include N- and C-terminal truncation mutants. Suitable truncation mutants include 
HBcAgs where 1, 2, 5, 7, 9, 10, 12, 14, 15, and 17 amino acids have been 
removed from the N-terminus and 1, 5, 10,' 15, 20, 25, 30, 34, 35, 36, 37, 38, 39 
40, 41, 42 or 48 amino acids have been removed from the C-terminus. 

[0164] The invention further includes vaccine compositions comprising HBcAg 

polypeptides comprising, or alternatively consisting of, amino acid sequences 
which are at least 80%, 85%, 90%, 95%, 97%, or 99% identical to the above 
described truncation mutants. 
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[0165] As discussed above, in certain embodiments of the invention, a lysine 

residue is introduced as a first attachment site into a polypeptide which forms the 
non-natural molecular scaffold. In preferred embodiments, vaccine compositions 
of the invention are prepared using a HBcAg comprising, or alternatively 
consisting of, amino acids 1-144 or amino acids 1-149 of SEQ ID NO: 134 which 
is modified so that the amino acids corresponding to positions 79 and 80 are 
replaced with a peptide having the amino acid sequence of Gly-Gly-Lys-Gly-Gly 
(SEQ ID NO: 158) and the cysteine residues at positions 48 and 107 are either 
deleted or substituted with another amino acid residue, while the cysteine at 
position 61 is left in place. The invention further includes vaccine compositions 
comprising corresponding fragments of polypeptides having amino acid sequences 
shown in any of SEQ ID NOs:89-132 and 135-136 which also have the above 
noted amino acid alterations. 

[0166] The invention further includes vaccine compositions comprising fragments 

of a HBcAg comprising, or alternatively consisting of, an amino acid sequence 
other than that shown in SEQ ID NO: 134 from which a cysteine residue not 
present at corresponding location in SEQ ID NO: 134 has been deleted. One 
example of such a fragment would be a polypeptide comprising, or alternatively 
consisting of, amino acids amino acids 1-149 of SEQ ID NO: 132 where the 
cysteine residue at position 147 has been either substituted with another amino 
acid residue or deleted. 

[0167] The invention further includes vaccine compositions comprising HBcAg 

polypeptides comprising, or alternatively consisting of, amino acid sequences 
which are at least 80%, 85%, 90%, 95%, 97%, or 99% identical to amino acids 
1-144 or 1-149 of SEQ ID NO: 134 and corresponding subportions of a 
polypeptide comprising an amino acid sequence shown in any of SEQ ID 
NOs:89-132 or 134-136, as well as to amino acids 1-147 or 1-152 of SEQ ID 
NO:158. 

[0168] The invention also includes vaccine compositions comprising HBcAg 

polypeptides comprising, or alternatively consisting of, amino acid sequences 
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which are at least 80%, 85%, 90%, 95%, 97%, or 99% identical to amino acids 
36-240, 36-269, 44-240, 44-269, 36-305, and 44-305 of SEQ ID NO:137 or 
36-240, 36-269, 44-240, 44-269, 36-305, and 44-305 of SEQ ID NO: 138. 

[0169] Vaccine compositions of the invention may comprise mixtures of different 

HBcAgs. Thus, these vaccine compositions may be composed of HBcAgs which 
differ in amino acid sequence. For example, vaccine compositions could be 
prepared comprising a M wild-type" HBcAg and a modified HBcAg in which one 
or more amino acid residues have been altered (e.g., deleted, inserted or 
substituted). In most applications, however, only one type of a HBcAg, or at least 
HBcAgs having essentially equivalent first attachment sites, will be used because 
vaccines prepared using such HBcAgs will present highly ordered and repetitive 
arrays of antigens or antigenic determinants. Further, preferred vaccine 
compositions of the invention are those which present highly ordered and 
repetitive antigen arrays. 

[0170] The invention further includes vaccine compositions where the non-natural 

molecular scaffold is prepared using a HBcAg fused to another protein. As 
discussed above, one example of such a fusion protein is a UBcAg/FOS fusion. 
Other examples of HBcAg fusion proteins suitable for use in vaccine compositions 
of the invention include fusion proteins where an amino acid sequence has been 
added which aids in the formation and/or stabilization of HBcAg dimers and 
multimers. This additional amino acid sequence may be fused to either the N- or 
C-terminus of the HBcAg. One example, of such a fusion protein is a fusion of 
a HBcAg with the GCN4 helix region of Saccharomyces cerevisiae (GenBank 
Accession No. P03069 (SEQ ID NO: 154)). 

[0171] The helix domain of the GCN4 protein forms homodimers via 

non-covalent interactions which can be used to prepare and stabilize HBcAg 
dimers and multimers. 

[0172] In one embodiment, the invention provides vaccine compositions prepared 

using HBcAg fusions proteins comprising a HBcAg, or fragment thereof, with a 
GCN4 polypeptide having the sequence of amino acid residues 227 to 276 in SEQ 
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ID NO: 154 fused to the C-terminus. This GCN4 polypeptide may also be fused 
to the N-terminus of the HbcAg. 

[0173] HBcAg/src homology 3 (SH3) domain fusion proteins could also be used 

to prepare vaccine compositions of the invention. SH3 domains are relatively 
small domains found in a number of proteins which confer the ability to interact 
with specific proline-rich sequences in protein binding partners (see McPherson, 
Cell Signal 77:229-238 (1999). HBcAg/SH3 fusion proteins could be used in 
several ways. First, the SH3 domain could form a first attachment site which 
interacts with a second attachment site of the antigen or antigenic determinant. 
Similarly, a proline rich amino acid sequence could be added to the HBcAg and 
used as a first attachment site for an SH3 domain second attachment site of an 
antigen or antigenic determinant. Second, the SH3 domain could associate with 
proline rich regions introduced into HBcAgs. Thus, SH3 domains and proline rich 
SH3 interaction sites could be inserted into either the same or different HBcAgs 
and used to form and stabilized dimers and multimers of the invention. 

[0174] In other embodiments, a bacterial pilin, a subportion of a bacterial pilin, or 

a fusion protein which contains either a bacterial pilin or subportion thereof is used 
to prepare vaccine compositions of the invention. Examples of pilin proteins 
include pilins produced by Escherichia coli, Haemophilus influenzae, Neisseria 
meningitidis, Neisseria gonorrhoeae, Caulobacter crescentus, Pseudomonas 
stutzeri, and Pseudomonas aeruginosa. The amino acid sequences of pilin 
proteins suitable for use with the present invention include those set out in 
GenBank reports AJ000636 (SEQ ID NO: 139), AJ132364 (SEQ ID NO: 140), 
AP229646 (SEQ ID NO:141), AF051814 (SEQ ID NO: 142), AF051815 (SEQ 
ID NO: 143), and X00981 (SEQ ID NO: 155), the entire disclosures of which are 
incorporated herein by reference. 

[0175] Bacterial pilin proteins are generally processed to remove N-terminal 

leader sequences prior to export of the proteins into the bacterial periplasm. 
Further, as one skilled in the art would recognize, bacterial pilin proteins used to 
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prepare vaccine compositions of the invention will generally not have the naturally 
present leader sequence. 

[0176] One specific example of a pilin protein suitable for use in the present 

invention is theP-pilin ofE. coli (GenBank report AF237482 (SEQ ID NO: 144)). 
An example of a Type-1 E. coli pilin suitable for use with the invention is a pilin 
having the amino acid sequence set out in GenBank report P04128 (SEQ ID 
NO: 146), which is encoded by nucleic acid having the nucleotide sequence set out 
in GenBank report M27603 (SEQ ID NO: 145). The entire disclosures of these 
GenBank reports are incorporated herein by reference. Again, the mature form 
of the above referenced protein would generally be used to prepare vaccine 
compositions of the invention. Another example of a pilin protein is SEQ ID NO : 
184 , which is identical to SEQ ID NO: 146, except that in SEQ ID NO: 146, 
amino acid 20 is threonine, but in SEQ ID NO: 184, amino acid 20 is alanine. 

[0177] Bacterial pilins or pilin subportions suitable for use in the practice of the 

present invention will generally be able to associate to form non-natural molecular 
scaffolds. 

[0178] Methods for preparing pili and pilus-like structures in vitro are known in 

the art. Bullitt et al, Proc. Natl Acad. Sci. USA 93:12890-12895 (1996), for 
example, describe the in vitro reconstitution of E. coli P-pili subunits. Further, 
Eshdat et al, J. Bacteriol. 745:308-314 (1981) describe methods suitable for 
dissociating Type-1 pili of E. coli and the reconstitution of pili. In brief, these 
methods are as follows: pili are dissociated by incubation at 37°C in saturated 
guanidine hydrochloride. Pilin proteins are then purified by chromatography, after 
which pilin dimers are formed by dialysis against 5 raM tris(hydroxymethyl) 
aminomethane hydrochloride (pH 8.0). Eshdat et al also found that pilin dimers 
reassemble to form pili upon dialysis against the 5 mM tris(hydroxymethyl) 
aminomethane (pH 8.0) containing 5 mM MgCl 2 . 

[0179] Further, using, for example, conventional genetic engineering and protein 

modification methods, pilin proteins maybe modified to contain a first attachment 
site to which an antigen or antigenic determinant is linked through a second 
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attachment site. Alternatively, antigens or antigenic determinants can be directly 
linked through a second attachment site to amino acid residues which are naturally 
resident in these proteins. These modified pilin proteins may then be used in 
vaccine compositions of the invention. 

[0180] Bacterial pilin proteins used to prepare vaccine compositions of the 

invention may be modified in a manner similar to that described herein for HBcAg. 
For example, cysteine and lysine residues maybe either deleted or substituted with 
other amino acid residues and first attachment sites may be added to these 
proteins. Further, pilin proteins may either be expressed in modified form or may 
be chemically modified after expression. Similarly, intact pili may be harvested 
from bacteria and then modified chemically. 

[0181] In another embodiment, pili or pilus-like structures are harvested from 

bacteria (e.g., E. coif) and used to form vaccine compositions of the invention. 
One example of pili suitable for preparing vaccine compositions is the Type-1 pilus 
of E. coli, which is formed from pilin monomers having the amino acid sequence 
set out in SEQ ID NO: 146. 

[0182] A number of methods for harvesting bacterial pili are known in the art. 

Bullitt and Makowski (Biophys. J. 74:623-632 (1998)), for example, describe a 
pilus purification method for harvesting P-pili from E. coli. According to this 
method, pili are sheared from hyperpiliated E. coli containing a P-pilus plasmid 
and purified by cycles of solubilization and MgCl 2 (1 .0 M) precipitation. A similar 
purification method is set out below in Example 33. 

[0183] Once harvested, pili or pilus-like structures may be modified in a variety 

of ways. For example, a first attachment site can be added to the pili to which 
antigens or antigen determinants may be attached through a second attachment 
site. In other words, bacterial pili or pilus-like structures can be harvested and 
modified to form non-natural molecular scaffolds. 

[0184] Pili or pilus-like structures may also be modified by the attachment of 

antigens or antigenic determinants in the absence of a non-natural organizer. For 
example, antigens or antigenic determinants could be linked to naturally occurring 
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cysteine resides or lysine residues. In such instances, the high order and 
repetitiveness of a naturally occurring amino acid residue would guide the 
coupling of the antigens or antigenic determinants to the pili or pilus-like 
structures. For example, the pili or pilus-like structures could be linked to the 
second attachment sites of the antigens or antigenic determinants using a 
heterobifunctional cross-linking agent. 

[0185] When structures which are naturally synthesized by organisms (e.g., pili) 

are used to prepare vaccine compositions of the invention, it will often be 
advantageous to genetically engineer these organisms so that they produce 
structures having desirable characteristics. For example, when Type-1 pili ofi?. 
coli are used, the E. coli from which these pili are harvested may be modified so 
as to produce structures with specific characteristics. Examples of possible 
modifications of pilin proteins include the insertion of one or more lysine residues, 
the deletion or substitution of one or more of the naturally resident lysine residues, 
and the deletion or substitution of one or more naturally resident cysteine residues 
(e.g., the cysteine residues at positions 44 and 84 in SEQ ID NO: 146). 

[0186] Further, additional modifications can be made to pilin genes which result 

in the expression products containing a first attachment site other than a lysine 
residue (e.g., a FOS or JUN domain). Of course, suitable first attachment sites 
will generally be limited to those which do not prevent pilin proteins from forming 
pili or pilus-like structures suitable for use in vaccine compositions of the 
invention. 

[0187] Pilin genes which naturally reside in bacterial cells can be modified in vivo 

( e -g> 9 by homologous recombination) or pilin genes with particular characteristics 
can be inserted into these cells. For examples, pilin genes could be introduced into 
bacterial cells as a component of either a replicable cloning vector or a vector 
which inserts into the bacterial chromosome. The inserted pilin genes may also 
be linked to expression regulatory control sequences (e.g., a lac operator). 

[0188] In most instances, the pili or pilus-like structures used in vaccine 

compositions of the invention will be composed of single type of a pilin subunit 
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Pili or pilus-like structures composed of identical subunits will generally be used 
because they are expected to form structures which present highly ordered and 
repetitive antigen arrays. 

[0189] However, the compositions of the invention also include vaccines 

comprising pili or pilus-like structures formed from heterogenous pilin subunits. 
The pilin subunits which form these pili or pilus-like structures can be expressed 
from genes naturally resident in the bacterial cell or may be introduced into the 
cells. When a naturally resident pilin gene and an introduced gene are both 
expressed in a cell which forms pili or pilus-like structures, the result will generally 
be structures formed from a mixture of these pilin proteins. Further, when two or 
more pilin genes are expressed in a bacterial cell, the relative expression of each 
pilin gene will typically be the factor which determines the ratio of the different 
pilin subunits in the pili or pilus-like structures. 

[0190] When pili or pilus-like structures having a particular composition of mixed 

pilin subunits is desired, the expression of at least one of the pilin genes can be 
regulated by a heterologous, inducible promoter. Such promoters, as well as 
other genetic elements, can be used to regulate the relative amounts of different 
pilin subunits produced in the bacterial cell and, hence, the composition of the pili 
or pilus-like structures. 

[0191] In additional, while in most instances the antigen or antigenic determinant 

will be linked to bacterial pili or pilus-like structures by a bond which is not a 
peptide bond, bacterial cells which produce pili or pilus-like structures used in the 
compositions of the invention can be genetically engineered to generate pilin 
proteins which are fused to an antigen or antigenic determinant. Such fusion 
proteins which form pili or pilus-like structures are suitable for use in vaccine 
compositions of the invention. 

[0192] The inventors surprisingly found that bacterial Pili induced an antibody 

response dominated by the IgGl isotype in mince. This type of antibodies is 
indicative for a Th2 response. Moreover, antigens coupled to Pili also induced a 
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IgGl response indicating that coupling of antigens to Pili was sufficient for 
induction of antigen-specific Th2 responses. 

B. Construction of an Antigen or Antigenic Determinant with a 
Second Attachment Site 

[0193] The second element in the compositions of the invention is an antigen or 

antigenic determinant possessing at least one second attachment site capable of 
association through at least one non-peptide bond to the first attachment site of 
the non-natural molecular scaffold. The invention provides for compositions that 
vary according to the antigen or antigenic determinant selected in consideration 
of the desired therapeutic effect. Other compositions are provided by varying the 
molecule selected for the second attachment site. 

[0194] However, when bacterial pili, or pilus-like structures, pilin proteins are 

used to prepare vaccine compositions of the invention, antigens or antigenic 
determinants may be attached to pilin proteins by the expression of pilin/antigen 
fusion proteins. Antigen and antigenic determinants may also be attached to 
bacterial pili, or pilus-like structures, pilin proteins through non-peptide bonds. 

[0195] Antigens of the invention may be selected from the group consisting of the 

following: (a) proteins suited to induce an immune response against cancer cells; 
(b) proteins suited to induce an immune response against infectious diseases; (c) 
proteins suited to induce an immune response against allergens ,(d) proteins suited 
to induce an immune response in farm animals, and (e) fragments (e.g. , a domain) 
of any of the proteins set out in (a)-(d). 

[0196] In one specific embodiment of the invention, the antigen or antigenic 

determinant is one that is useful for the prevention of infectious disease. Such 
treatment will be useful to treat a wide variety of infectious diseases affecting a 
wide range of hosts, e.g., human, cow, sheep, pig, dog, cat, other mammalian 
species and non-mammalian species as well. Treatable infectious diseases are well 
known to those skilled in the art, examples include infections of viral etiology such 
as HIV, influenza, Herpes, viral hepatitis, Epstein Bar, polio, viral encephalitis, 
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measles, chicken pox, etc.; or infections of bacterial etiology such as pneumonia, 
tuberculosis, syphilis, etc.; or infections of parasitic etiology such as malaria, 
trypanosomiasis, leishmaniasis, trichomoniasis, amoebiasis, etc. Thus, antigens or 
antigenic determinants selected for the compositions of the invention will be well 
known to those in the medical art; examples of antigens or antigenic determinants 
include the following : the HIV antigens gp 1 40 and gp 1 60; the influenaza antigens 
hemagglutinin and neuraminidase, Hepatitis B surface antigen, circumsporozoite 
protein of malaria. 

[0197] In another specific embodiment, compositions of the invention are an 

immunotherapeutic that may be used for the treatment of allergies or cancer. 

[0198] The selection of antigens or antigenic determinants for compositions and 

methods of treatment for allergies would be known to those skilled in the medical 
art treating such disorders; representative examples of this type of antigen or 
antigenic determinant include the following: bee venom phospholipase A 2 , Bet v 
I (birch pollen allergen), 5 Dol m V (white-faced hornet venom allergen), Der p 
I (House dust mite allergen). 

[0199] The selection of antigens or antigenic determinants for compositions and 

methods of treatment for cancer would be known to those skilled in the medical 
art treating such disorders; representative examples of this type of antigen or 
antigenic determinant include the following: Her2 (breast cancer), GD2 
(neuroblastoma), EGF-R (malignant glioblastoma), CEA (medullary thyroid 
cancer), CD52 (leukemia). 

[0200] In a particular embodiment of the invention, the antigen or antigenic 

determinant is selected from the group consisting of: (a) a recombinant protein of 
HIV, (b) a recombinant protein of Influenza virus, (c) a recombinant protein of 
Hepatitis B virus, (d) a recombinant protein of Toxoplasma, (e) a recombinant 
protein of Plasmodium falciparum, (f) a recombinant protein of Plasmodium 
vivax, (g) a recombinant protein of Plasmodium ovale, (h) a recombinant protein 
of Plasmodium malariae, (i) a recombinant protein of breast cancer cells, (j) a 
recombinant protein of kidney cancer cells, (k) a recombinant protein of prostate 
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cancer cells, (1) a recombinant protein of skin cancer cells, (m) a recombinant 
protein of brain cancer cells, (n) a recombinant protein of leukemia cells, (o) a 
recombinant profiling, (p) a recombinant protein of bee sting allergy, (q) a 
recombinant proteins of nut allergy, (r) a recombinant proteins of food allergies, 
(s) recombinant proteins of asthma, (t) a recombinant protein of Chlamydia, and 
(u) a fragment of any of the proteins set out in (a)-(t). 
[0201] Once the antigen or antigenic determinant of the composition is chosen, 

at least one second attachment site may be added to the molecule in preparing to 
construct the organized and repetitive array associated with the non-natural 
molecular scaffold of the invention. Knowledge of what will constitute an 
appropriate second attachment site will be known to those skilled in the art. 
Representative examples of second attachment sites include, but are not limited 
to, the following: an antigen, an antibody or antibody fragment, biotin, avidin, 
strepavidin, a receptor, a receptor ligand, a ligand, a ligand-binding protein, an 
interacting leucine zipper polypeptide, an amino group, a chemical group reactive 
to an amino group; a carboxyl group, chemical group reactive to a carboxyl 
group, a sulfhydryl group, a chemical group reactive to a sulfhydryl group, or a 
combination thereof 

[0202] The association between the first and second attachment sites will be 

determined by the characteristics of the respective molecules selected but will 
comprise at least one non-peptide bond. Depending upon the combination of first 
and second attachment sites, the nature of the association may be covalent, ionic, 
hydrophobic, polar, or a combination thereof 

[0203] In one embodiment of the invention, the second attachment site may be the 

FOS leucine zipper protein domain or the JUN leucine zipper protein domain. 

[0204] In a more specific embodiment of the invention, the second attachment site 

selected is the FOS leucine zipper protein domain, which associates specifically 
with the JUN leucine zipper protein domain of the non-natural molecular scaffold 
of the invention. The association of the JUN and FOS leucine zipper protein 
domains provides a basis for the formation of an organized and repetitive antigen 
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or antigenic determinant array on the surface of the scaffold. The FOS leucine 
zipper protein domain may be fused in frame to the antigen or antigenic 
determinant of choice at either the amino terminus, carboxyl terminus or internally 
located in the protein if desired. 

[0205] Several FOS fusion constructs are provided for exemplary purposes. 

Human growth hormone (Example 4), bee venom phospholipase A 2 (PLA) 
(Example 9), ovalbumin (Example 10) and HIV gpl40 (Example 12). 

[0206] In order to simplify the generation of FOS fusion constructs, several 

vectors are disclosed that provide options for antigen or antigenic determinant 
design and construction (see Example 6). The vectors pAVl-4 were designed for 
the expression of FOS fusion in E. coli; the vectors pAV5 and pAV6 were 
designed for the expression of FOS fusion proteins in eukaryotic cells. Properties 
of these vectors are briefly described: 

[0207] 1 . pAVl : This vector was designed for the secretion of fusion proteins 

with FOS at the C-terminus into the E. coli periplasmic space. The gene of 
interest (g.o.i.) may be ligated into the StuI/NotI sites of the vector. 

[0208] 2. pAV2 : This vector was designed for the secretion of fusion proteins 

with FOS at the N-terminus into the E. coli periplasmic space. The gene of 
interest (g.o.i.) ligated into theNotl/EcoRV (orNotl/Hindlll) sites of the vector. 

[0209] 3. pAV3 : This vector was designed for the cytoplasmic production of 

fusion proteins With FOS at the C-terminus in E. coli. The gene of interest (g.o.i.) 
may be ligated into the EcoRV/NotI sites of the vector. 

[0210] 4. pAV4 : This vector is designed for the cytoplasmic production of 

fusion proteins withFOS at the N-terminus inE. coli. The gene of interest (g.o.i.) 
may be ligated into the Notl/EcoRV (or Notl/Hindlll) sites of the vector. The 
N-terminal methionine residue is proteolytically removed upon protein synthesis 
(Hirel etal, Proc. Natl. Acad ScL USA 55:8247-8251 (1989)). 

[0211] 5. pAV5 : This vector was designed for the eukaryotic production of 

fusion proteins with FOS at the C-terminus. The gene of interest (g.o.i.) may be 
inserted between the sequences coding for the hGH signal sequence and the FOS 
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domain by ligation into the Eco47III/NotI sites of the vector. Alternatively, a 
gene containing its own signal sequence may be fused to the FOS coding region 
by ligation into the StuI/NotI sites. 

[0212] 6. pAV6 : This vector was designed for the eukaryotic production of 

fusion proteins with FOS at the N-terminus. The gene of interest (g.o.i.) may be 
ligated into the Notl/StuI (or Notl/Hindlll) sites of the vector. 

[0213] As will be understood by those skilled in the art, the construction of a 

FO^-antigen or -antigenic determinant fusion protein may include the addition of 
certain genetic elements to facilitate production of the recombinant protein. 
Example 4 provides guidance for the addition of certain E. coli regulatory 
elements for translation, and Example 7 provides guidance for the addition of a 
eukaryotic signal sequence. Other genetic elements may be selected, depending 
on the specific needs of the practioner. 

[0214] The invention is also seen to include the production of the FOS- antigen or 

FC^S-antigenic determinant fusion protein either in bacterial (Example 5) or 
eukaryotic cells (Example 8). The choice of which cell type in which to express 
the fusion protein is within the knowledge of the skilled artisan, depending on 
factors such as whether post-translational modifications are an important 
consideration in the design of the composition. 

[0215] As noted previously, the invention discloses various methods for the 

construction of a FOS-antigen or FOS-antigenic determinant fusion protein 
through the use of the pAV vectors. In addition to enabling prokaryotic and 
eukaryotic expression, these vectors allow the practitioner to choose between N~ 
and C-terminal addition to the antigen of the FOS leucine zipper protein domain. 
Specific examples are provided wherein N- and C-terminal FOS fusions are made 
to PL A (Example 9) and ovalbumin (Example 10). Example 1 1 demonstrates the 
purification of the PL A and ovalbumin FOS fusion proteins. 

[0216] In a more specific embodiment, the invention is drawn to an antigen or 

antigenic determinant encoded by the HIV genome. More specifically, the HIV 
antigen is gp 1 40. As provided for in Examples 11-15, HIV gp 140 may be created 
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with aFOS leucine zipper protein domain and the fusion protein synthesized and 
purified for attachment to the non-natural molecular scaffold of the invention. As 
one skilled in the art would know, other HIV antigens or antigenic determinants 
may be used in the creation of a composition of the invention. 
[0217] In a more specific embodiment of the invention, the second attachment site 

selected is a cysteine residue, which associates specifically with a lysine residue of 
the non-natural molecular scaffold of the invention. The chemical linkage of the 
lysine residue (Lys) and cysteine residue (Cys) provides a basis for the formation 
of an organized and repetitive antigen or antigenic determinant array on the 
surface of the scaffold. The cysteine residue may be engineered in frame to the 
antigen or antigenic determinant of choice at either the amino terminus, carboxyl 
terminus or internally located in the protein if desired. By way of example, PLA 
and HIV gp 1 40 are provided with a cysteine residue for linkage to a lysine residue 
first attachment site. 

C. Preparation of the AlphaVaccine Particles 

[0218] The invention provides novel compositions and methods for the 

construction of ordered and repetitive antigen arrays. As one of skill in the art 
would know, the conditions for the assembly of the ordered and repetitive antigen 
array depend to a large extent on the specific choice of the first attachment site of 
the non-natural molecular scaffold and the specific choice of the second 
attachment site of the antigen or antigenic determinant. Thus, practitioner choice 
in the design of the composition (i.e. , selection of the first and -second attachment 
sites, antigen and non-natural molecular scaffold) will determine the specific 
conditions for the assembly of the AlphaVaccine particle (the ordered and 
repetitive antigen array and non-natural molecular scaffold combined). 
Information relating to assembly of the AlphaVaccine particle is well within the 
working knowledge of the practitioner, and numerous references exist to aid the 
practitioner (e.g., Sambrook, J. et aL, eds., Molecular CLONING, A 
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LABORATORY Manual, 2nd. edition, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y. (1989); Ausubel, F. et al 9 eds., CURRENT PROTOCOLS IN 
Molecular Biology, John H. Wiley & Sons, Inc. (1997); Celis, J., ed., CELL 
BIOLGY, Academic Press, 2 nd edition, (1998); Harlow, E. and Lane, D., 
"Antibodies: ALaboratory Manual," Cold Spring Harbor Laboratory, Cold Spring 
Harbor, N.Y. (1988), all of which are incorporated herein by reference. 

[0219] In a specific embodiment of the invention, the JUN and FOS leucine zipper 

protein domains are utilized for the first and second attachment sites of the 
invention, respectively. In the preparation of AlphaVaccine particles, antigen must 
be produced and purified under conditions to promote assembly of the ordered 
and repetitive antigen array onto the non-natural molecular scaffold. In the 
particular JUNIFOS leucine zipper protein domain embodiment, theFOS-antigen 
or FCtf-antigenic determinant should be treated with a reducing agent (e.g., 
Dithiothreitol (DTT)) to reduce or eliminate the incidence of disulfide bond 
formation (Example 15). 

[0220] For the preparation of the non-natural molecular scaffold (i.e., 

recombinant Sinbis virus) of the JUNIFOS leucine zipper protein domain 
embodiment, recombinant E2-JUN viral particles should be concentrated, 
neutralized and treated with reducing agent (see Example 16). 

[0221] Assembly of the ordered and repetitive antigen array in the JUNIFOS 

embodiment is done in the presence of a redox shuffle. T12-JUN viral particles are 
combined with a 240 fold molar excess of FC^-antigen or FO^-antigenic 
determinant for 10 hours at 4°C. Subsequently, the AlphaVaccine particle is 
concentrated and purified by chromatography (Example 16). 

[0222] In another embodiment of the invention, the coupling of the non-natural 

molecular scaffold to the antigen or antigenic determinant may be accomplished 
by chemical cross-linking. In a specific embodiment, the chemical agent is a 
heterobifunctional cross-linking agent such as s-maleimidocaproic acid N- 
hydroxysuccinimideester(Tanimorie^a/., J Pharm. Dyn. 4:812 (1981); Fujiwara 
etal, J. Immunol. Meth. 45: 195 (1981)), which contains (1) a succinimide group 
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reactive with amino groups and (2) a maleimide group reactive with SH groups. 
A heterologous protein or polypeptide of the first attachment site may be 
engineered to contain one or more lysine residues that will serve as a reactive 
moiety for the succinimide portion of the heterobifunctional cross-linking agent. 
Once chemically coupled to the lysine residues of the heterologous protein, the 
maleimide group of the heterobifunctional cross-linking agent will be available to 
react with the SH group of a cysteine residue on the antigen or antigenic 
determinant. Antigen or antigenic determinant preparation in this instance may 
require the engineering of a cysteine residue into the protein or polypeptide chosen 
as the second attachment site so that it may be reacted to the free maleimide 
function on the cross-linking agent bound to the non-natural molecular scaffold 
first attachment sites. Thus, in such an instance, the heterobifunctional 
cross-linking agent binds to a first attachment site of the non-natural molecular 
scaffold and connects the scaffold to a second binding site of the antigen or 
antigenic determinant. 

3 . Compositions, Vaccines, and the Administration Thereof, and Methods of 
Treatment 

[0223] In one embodiment, the invention provides vaccines for the prevention of 

infectious diseases in a wide range of species, particularly mammalian species such 
as human, monkey, cow, dog, cat, horse, pig, etc. Vaccines maybe designed to 
treat infections of viral etiology such as HIV, influenza, Herpes, viral hepatitis, 
Epstein Bar, polio, viral encephalitis, measles, chicken pox, etc.; or infections of 
bacterial etiology such as pneumonia, tuberculosis, syphilis, etc.; or infections of 
parasitic etiology such as malaria, trypanosomiasis, leishmaniasis, trichomoniasis, 
amoebiasis, etc. 

[0224] In another embodiment, the invention provides vaccines for the prevention 

of cancer in a wide range of species, particularly mammalian species such as 
human, monkey, cow, dog, cat, horse, pig, etc. Vaccines may be designed to treat 
all types of cancer: lymphomas, carcinomas, sarcomas, melanomas, etc. 
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[0225] In another embodiment of the invention, compositions of the invention 

may be used in the design of vaccines for the treatment of allergies. Antibodies 
of the IgE isotype are important components in allergic reactions. Mast cells bind 
IgE antibodies on their surface and release histamines and other mediators of 
allergic response upon binding of specific antigen to the IgE molecules bound on 
the mast cell surface. Inhibiting production of IgE antibodies, therefore, is a 
promising target to protect against allergies. This should be possible by attaining 
a desired T helper cell response. T helper cell responses can be divided into type 
1 (T H 1) and type 2 (T H 2) T helper cell responses (Romagnani, Immunol Today 
75:263-266 (1997)). T H 1 cells secrete interferon-gamma and other cytokines 
which trigger B cells to produce protective IgG antibodies. In contrast, a critical 
cytokine produced by T H 2 cells is IL-4, which drive B cells to produce IgE. In 
many experimental systems, the development of T H 1 and T H 2 responses is 
mutually exclusive sinceT H l cells suppress the induction of T H 2 cells and vice 
versa. Thus, antigens that trigger a strong T H 1 response simultaneously suppress 
the development of T H 2 responses and hence the production of IgE antibodies. 
Interestingly, virtually all viruses induce a T H 1 response in the host and fail to 
trigger the production of IgE antibodies (Coutelier et ah, J. Exp. Med. 765:64-69 
(1987)). This isotype pattern is not restricted to live viruses but has also been 
observed for inactivated or recombinant viral particles (Lo-Man et al , Eur. J. 
Immunol 28:1401-1407 (1998)). Thus, by using the processes of the invention 
{e.g., AlphaVaccine Technology), viral particles can be decorated with various 
allergens and used for immunization. Due to the resulting "viral structure" of the 
allergen, a T H 1 response will be elicited, "protective" IgG antibodies will be 
produced, and the production of IgE antibodies which cause allergic reactions will 
be prevented. Since the allergen is presented by viral particles which are 
recognized by a different set of helper T cells than the allergen itself, it is likely 
that the allergen-specific IgG antibodies will be induced even in allergic individuals 
harboring pre-existing T H 2 cells specific for the allergen. The presence of high 
concentrations of IgG antibodies may prevent binding of allergens to mast cell 
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bound IgE, thereby inhibiting the release of histamine. Thus, presence of IgG 
antibodies may protect from IgE mediated allergic reactions. Typical substances 
causing allergies include: grass, ragweed, birch or mountain cedar pollens, house 
dust, mites, animal danders, mold, insect venom or drugs (e.g., penicillin). Thus, 
immunization of individuals with allergen-decorated viral particles should be 
beneficial not only before but also after the onset of allergies. Food allergies are 
also very common, and immunization of subjects with particles decorated with 
food allergens should be useful for the treatment of these allergies. 

[0226] In another embodiment, the invention relates to the induction of specific 

Th type 2 (Th2) cells. The inventors surprisingly found that bacterial Pili induce 
an antibody response dominated by the IgGl isotype in mice, indicative of a Th2 
response. Antigens coupled to Pili also induced a IgGl response indicating that 
coupling of antigens to Pili was sufficient for induction of antigen-specific Th2 
response. Many chronic diseases in humans an animals, such as arthritis, colitis, 
diabetes and multiple sclerosis are dominated by Thl response, where T cells 
secrete IFNy and other pro-inflammatory cytokines precipitating disease. By 
contrast, Th2 cells secrete 11-4, 11-13 and also 11-10. The latter cytokine is usually 
associated with immunosuppression and there is good evidence that specific Th2 
cells can suppress chronic diseases^ such as arthritis, colitis, diabetes and multiple 
sclerosis in vivo. Thus, induction of antigen-specific Th2 cells is desirable for the 
treatment of such chronic diseases. 

[0227] It is known that induction of therapeutic self-specific antibodies may allow 

treating a variety of diseases. It is, e.g., known that anti-TNF antibodies can 
ameliorate symptoms in arthritis or colitis and antibodies specific for the AJ3- 
peptide may remove plaques from the brain of Alzheimers patients. It will usually 
be beneficial for the patient if such antibodies can be induced in the absence of a 
pro-inflammatory Thl response. Thus, self antigens coupled to Pili that induce 
a strong antibody response but no Thl response may be optimal for such 
immunotherapy. 
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[0228] In a preferred embodiment, the antigen is the amyloid beta peptide (Ap W2 ) 

(DAEFRHDSGYEVHHQKL VFFAEDVGSNKGAIIGLMVGGVVIA (SEQ ID 
NO: 174), or a fragment thereof: The amyloid beta protein is SEQ ID NO: 172. 
The amyloid beta precursor protein is SEQ ID NO: 173. 

[0229] The amyloid B peptide (Ap 1>42 ) has a central role in the neuropathology of 

Alzheimers disease. Region specific, extracellular accumulation of Ap peptide is 
accompanied by microgliosis, cytoskeletal changes, dystrophic neuritis and 
synaptic loss. These pathological alterations are thought to be linked to the 
cognitive decline that defines the disease. 

[0230] In a mouse model of Alzheimer disease, transgenic animals engineered to 

produce Ap^ (PDAPP-mice), develop plaques and neuron damage in their 
brains. Recent work has shown immunization of young PDAPP-mice, using Ap x _ 
42, resulted in inhibition of plaque formation and associated dystrophic neuritis 
(Schenk, D. et al, Nature 400:113-11 (1999)). 

[0231] Furthermore immunization of older PDAPP mice that had already 

developed AD-like neuropathologies, reduced the extent and progression of the 
neuropathologies. The immunization protocol for these studies was as follows; 
peptide was dissolved in aqueous buffer and mixed 1 : 1 with complete Freunds 
adjuvant (for primary dose) to give a peptide concentration of lOOpig/dose. 
Subsequent boosts used incomplete Freunds adjuvant. Mice received 11 
immunizations over an 1 1 month period. Antibodies titres greater than 1:10 000 
were achieved and maintained. Hence, immunization may be an effective 
prophylactic and therapeutic action against Alzheimer disease. 

[0232] In another study, peripherally administered antibodies raised against AP W2 , 

were able to cross the blood-brain barrier, bind Ap peptide, and induce clearance 
of pre-existing amyloid (Bard, F. etal, Nature Medicine 6: 916-19 (2000)). This 
study utilized either polyclonal antibodies raised against Ap M2 , or monoclonal 
antibodies raised against synthetic fragments derived from different regions of Ap. 
Thus induction of antibodies can be considered as a potential therapeutic 
treatment for Alzheimer disease. 
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[0233] In another more specific embodiment, the invention is drawn to vaccine 

compositions comprising at least one antigen or antigenic determinant encoded by 
an Influenza viral nucleic acid, and the use of such vaccine compositions to elicit 
immune responses. In an even more specific embodiment, the Influenza antigen 
or antigenic determinant may be an M2 protein (e.g., an M2 protein having the 
amino acids shown in SEQ ID NO: 171, GenBank Accession No. P06821, or in 
SEQIDNO: 170, PIR Accession No. MFIV62, or fragment thereof (e.g., amino 
acids from about 2 to about 24 in SEQ ID NO: 171, the amino acid sequence in 
SEQ ID NO: 170. Further, influenza antigens or antigenic determinants may be 
coupled to pili or pilus-like structures. Portions of an M2 protein (e.g., an M2 
protein having the amino acid sequence in SEQ ID NO: 170), as well as other 
proteins against which an immunological response is sought, suitable for use with 
the invention may comprise, or alternatively consist of, peptides of any number of 
amino acids in length but will generally be at least 6 amino acids in length (e.g., 
peptides 6, 7, 8, 9, 10, 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
80, 85, 90, 95, or 97 amino acids in length). 

[0234] In an even more specific embodiment, the Influenza antigen or antigenic 

determinant may be an M2 protein (e.g., an M2 protein having the amino acids 
shown in SEQ ID NO: 170, GenBank Accession No. P06821, or in SEQ ID NO: 
212, PIR Accession No. MFIV62, or fragment thereof (e.g., amino acids from 
about 2 to about 24 in SEQ ID NO: 171, the amino acid sequence in SEQ ID NO: 
170). 

[0235] As would be understood by one of ordinary skill in the art, when 

compositions of the invention are administered to an individual, they may be in a 
composition which contains salts, buffers, adjuvants, or other substances which 
are desirable for improving the efficacy of the composition. Examples of materials 
suitable for use in preparing pharmaceutical compositions are provided in 
numerous sources including REMINGTON'S PHARMACEUTICAL SCIENCES (Osol, A, 
ed., Mack Publishing Co., (1980)). 
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[0236] Compositions of the invention are said to be "pharmacologically accept- 

able" if their administration can be tolerated by a recipient individual. Further, the 
compositions of the invention will be administered in a "therapeutically effective 
amount" (i.e., an amount that produces a desired physiological effect). 

[0237] The compositions of the present invention may be administered by various 

methods known in the art, but will normally be administered by injection, infusion, 
inhalation, oral administration, or other suitable physical methods. The 
compositions may alternatively be administered intramuscularly, intravenously, or 
subcutaneously. Components of compositions for administration include sterile 
aqueous (e.g., physiological saline) or non-aqueous solutions and suspensions. 
Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. 
Carriers or occlusive dressings can be used to increase skin permeability and 
enhance antigen absorption. 

[0238] The present invention also provides a composition comprising a bacterial 

pilin polypeptide to which an antigen or antigenic determinant has been attached 
by a covalent bond. 

[0239] The present invention also provides a composition comprising a fragment 

of a bacteriophage coat protein to which an antigen or antigenic determinant has 
been attached by a covalent bond. 

[0240] The present invention also provides a composition comprising (a) non- 

natural molecular scaffold comprising (i) a core particle selected from the group 
consisting of (1) a bacterial pilus or pilin protein; and (2) a recombinant form of 
a bacterial pilus or pilin protein; and (ii) an organizer comprising at least one first 
attachment site, wherein the organizer is connected to the core particle by at least 
one covalent bond; and (b) an antigen or antigenic determinant with at least one 
second attachment site, the second attachment site being selected from the group 
consisting of (i) an attachment site not naturally occurring with the antigen or 
antigenic determinant; and (ii) an attachment site naturally occurring with the 
antigen or antigenic determinant, wherein the second attachment site is capable of 
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association through at least one non-peptide bond to the first attachment site; and 
wherein the antigen or antigenic determinant and the scaffold interact through the 
association to form an ordered and repetitive antigen array. 

[0241] The present invention also provides a composition comprising (a) a non- 

natural molecular scaffold comprising (i) a core particle selected from the group 
consisting of: (1) a bacterial pilus; and (2) a recombinant form of a bacterial pilus; 
and (ii) an organizer comprising at least one first attachment site, wherein the 
organizer is connected to the core particle by at least one covalent bond; and (b) 
an antigen or antigenic determinant with at least one second attachment site, the 
second attachment site being selected from the group consisting of (i) an 
attachment site not naturally occurring with the antigen or antigenic determinant; 
and (ii) an attachment site naturally occurring with the antigen or antigenic 
determinant, wherein the second attachment site is capable of association through 
at least one non-peptide bond to the first attachment site; and wherein the antigen 
or antigenic determinant and the scaffold interact through the association to form 
an ordered and repetitive antigen array. 

[0242] The present invention also provides a composition comprising (a) a non- 

natural molecular scaffold comprising (i) a virus-like particle that is a dimer or a 
multimer of a polypeptide comprising amino acids 1-147 of SEQ ID NO: 158 as 
core particle; and (ii) an organizer comprising at least one first attachment site, 
wherein the organizer is connected to the core particle by at least one covalent 
bond; and (b) an antigen or antigenic determinant with at least one second 
attachment site, the second attachment site being selected from the group 
consisting of (i) an attachment site not naturally occurring with the antigen or 
antigenic determinant; and (ii) an attachment site naturally occurring with the 
antigen or antigenic determinant, wherein the second attachment site is capable of 
association through at least one non-peptide bond to the first attachment site; and 
wherein the antigen or antigenic determinant and the scaffold interact through the 
association to form an ordered and repetitive antigen array. 
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[0243] The present invention also provides a pharmaceutical composition 

comprising any of compositions of the present invention, and a pharmaceutical^ 
acceptable carrier. 

[0244] The present invention also provides a vaccine compo sition comprising any 

of compositions of the present invention. The vaccine composition may further 
comprise at least one adjuvant. The present invention also provides a method of 
immunizing, comprising administering to a subject a vaccine composition of the 
present invention. 

[0245] The present invention also provides a composition comprising (a) a non- 

natural molecular scaffold comprising (i) Hepatitis B virus capsid protein 
comprising an amino acid sequence selected from the group consisting of (1) the 
amino acid sequence of SEQ ED NO: 8 9, (2) the amino acid sequence of SEQ ID 
NO:90 (3) the amino acid sequence of SEQ ID NO:93, (4) the amino acid 
sequence of SEQ ID NO:98, (5) the amino acid sequence of SEQ ID NO:99, (6) 
the amino acid sequence of SEQ ID NO: 102, (7)the amino acid sequence of SEQ 
ID NO: 104, (8) the amino acid sequence of SEQ ID NO: 105, (9) the amino acid 
sequence of SEQ ID NO: 106, (10) the amino acid sequence of SEQ ID NO: 119, 
(1 1) the amino acid sequence of SEQ ID NO: 120, (12) the amino acid sequence 
of SEQ ID NO: 123, (13) the amino acid sequence of SEQ ID NO: 125, (14) the 
amino acid sequence of SEQ ID NO: 13 1, (15) the amino acid sequence of SEQ 
ID NO: 132, (16) the amino acid sequence of SEQ ID NO: 134, (17) the amino 
acid sequence of SEQ ID NO: 157, and (18) the amino acid sequence of SEQ ID 
NO: 158; and (ii) an organizer comprising at least one first attachment site, 
wherein the organizer is connected to the core particle by at least one covalent 
bond; and (b) an antigen or antigenic determinant with at least one second 
attachment site, the second attachment site being selected from the group 
consisting of (i) an attachment site not naturally occurring with the antigen or 
antigenic determinant; and (ii) an attachment site naturally occurring with the 
antigen or antigenic determinant, wherein the second attachment site is capable of 
association through at least one non-peptide bond to the first attachment site; and 
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wherein the antigen or antigenic determinant and the scaffold interact through the 
association to form an ordered and repetitive antigen array. Preferably, the 
organizer is a polypeptide or residue thereof, wherein the second attachment site 
is a polypeptide or residue thereof, and wherein the first attachment site is a lysine 
residue and the second attachment site is a cysteine residue. Preferably, one or 
more cysteine residues of the Hepatitis B virus capsid protein have been either 
deleted or substituted with another amino acid residue. Preferably, the cysteine 
residues corresponding to amino acids 48 and 107 in SEQ ID NO: 134 have been 
either deleted or substituted with another amino acid residue. 

[0246] The present invention also provides a composition comprising: (1) a non- 

natural molecular scaffold comprising (i) a core particle selected from the group 
consisting of (1) abacterial pilus, and (2) a recombinant form of abacterial pilus 
or pilin protein; and (ii) an organizer comprising at least one first attachment site, 
wherein the organizer is connected to the core particle by at least one covalent 
bond; and (2) an antigen or antigenic determinant with at least one second 
attachment site, the second attachment site being selected from the group 
consisting of (i) an attachment site not naturally occurring with the antigen or 
antigenic determinant, and (ii) an attachment site naturally occurring with the 
antigen or antigenic determinant, wherein the second attachment site is capable of 
association through at least one non-peptide bond to the first attachment site, 
wherein the antigen or antigenic determinant and the scaffold interact through the 
association to form an ordered and repetitive antigen array, and wherein the 
antigen or antigenic determinant is selected from the group consisting of an 
influenza M2 peptide, the GRA2 polypeptide, the DP 178c peptide, the tumor 
necrosis factor polypeptide, a tumor necrosis factor peptide, the B2 peptide, the 
D2 peptide, and the Ap peptide. 

[0247] In the compositions and vaccines of the present invention, for a covalent 

bond between a first and second attachment site, the covalent bond is preferably 
not a peptide bond. 
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[0248] If a bacterial pilus is present in a composition or vaccine of the present 

invention, the pilus is preferably a Type-1 pilus of Eschericia coli. More 
preferably, pilin subunits of the Type-1 pilus comprises the amino acid sequence 
shown in SEQ ID NO: 146. Preferably, the bacterial pilus and the antigen or 
antigen determinant are attached via either a naturally or non-naturally occurring 
attachment. Preferably, the first attachment site will be a lysine residue, while hte 
second attachment site will be a cysteine residue present or engineered on the 
antigen If the attachment comprises interacting leucine zipper polypeptides, the 
polypeptides are preferably JUN and/or FOS leucine zipper polypeptides. 

[0249] In the compositions and vaccines of the present invention that comprise 

an organizer having a first attachment site, attached to the second attachment site 
on the antigen, the organizer is preferably a polypeptide or a residue thereof, and 
the second attachment site is preferably a polypeptide or a residue thereof More 
preferably, the first and/or the second attachment sites comprise an antigen and an 
antibody or antibody fragment thereto, biotin and avidin, strepavidin and biotin, 
a receptor and its ligand, a ligand-binding protein and its ligand, interacting leucine 
zipper polypeptides, an amino group and a chemical group reactive thereto, a 
carboxyl group and a chemical group reactive thereto, a sulfhydryl group and a 
chemical group reactive thereto, or a combination thereof More preferably, the 
first attachment site is an amino group, and the second attachment site is a 
sulfhydryl group. 

[0250] In the compositions and vaccines of the present invention, the antigen is 

preferably selected from the group consisting of a protein suited to induce an 
immune response against cancer cells, a protein suited to induce an immune 
response against infectious diseases, a protein suited to induce an immune 
response against allergens, and a protein suited to induce an immune response in 
farm animals. Preferably, the antigen induces an immune response against one or 
more allergens. More preferably, the antigen is a recombinant protein of HIV, a 
recombinant protein of Influenza virus, a recombinant protein of Hepatitis C virus, 
a recombinant protein of Toxoplasma, a recombinant protein of Plasmodium 
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falciparum, a recombinant protein of Plasmodium vivax, a recombinant protein of 
Plasmodium ovale, a recombinant protein of Plasmodium malariae, a recombinant 
protein of breast cancer cells, a recombinant protein of kidney cancer cells, a 
recombinant protein of prostate cancer cells, a recombinant protein of skin cancer 
cells, a recombinant protein of brain cancer cells, a recombinant protein of 
leukemia cells, a recombinant profiling, a recombinant protein of bee sting allergy, 
a recombinant protein of nut allergy, a recombinant protein of food allergies, or 
a recombinant protein of asthma, or a recombinant protein of Chlamydia. 

[0251] In the method of immunizing provided by the present invention, the 

immunization produces an immune response in the subject. Preferably, the 
immunization produces a humoral immune response, a cellular immune response, 
a humoral and a cellular immune response, or a protective immune response. 

[0252] In the compositions and vaccines of the present invention, the antigen or 

antigenic determinant is attached to the non-natural molecular scaffold through the 
first attachment site, to form an antigen array or antigenic determinant array. 
Preferably, the array is ordered and/or repetitive. 

[0253] In the compositions and vaccines of the present invention, the first and/or 

the second attachment sites are preferably attached via either a non-naturally 
occurring attachment, or by an attachment comprising interacting leucine zipper 
polypeptides. More preferably, the interacting leucine zipper polypeptides are 
JUN and/or FOS leucine zipper polypeptides. 

[0254] The present invention also provides a method for making the compositions 

and vaccines of the present invention, comprising combining the antigen or 
antigenic determinant with the non-natural molecular scaffold through the first 
attachment site and organizer present on the non-natural molecular scaffold. 

[0255] In addition to vaccine technologies, other embodiments of the invention 

are drawn to methods of medical treatment for cancer, allergies, and chronic 
diseases. 

[0256] Following is a protocol for analyzing pili by SDS-PAGE Analysis. Add 

trichloroacetic acid to a final concentration of 10% to the pili solution containing 
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approx. 50 ug of pili. Vortex and incubate for 10 minutes on ice. Centrifuge at 
maximal speed for 5 minutes in a microcentrifuge. Discard the supernatant and 
resuspend the pellet in 50 ul of a 8.5 M guanidiniumhydrochloride, pH 3 solution. 
Heat the sample for 15 minutes at 70°C. Precipitate the protein by adding 1 .5 ml 
of Ethanol precooled at -20°C, and centrifuge 5 minutes at RT at maximal speed. 
Resuspend the pellet in 15 ul of a 10 mM Tris, pH 8 buffer. Add SDS-PAGE 
sample buffer, vortex shortly and heat the sample 10 minutes at 100°C. Load the 
sample on a 12% gel. 

EXAMPLES 

[0257] Enzymes and reagents used in the experiments that follow included: T4 

DNAligase obtained from New England Biolabs; Taq DNA Polymerase, QIAprep 
Spin Plasmid Kit, QIAGEN Plasmid Midi Kit, QiaExII Gel Extraction Kit, 
QIAquick PCR Purification Kit obtained from QIAGEN; QuickPrep Micro 
mRNA Purification Kit obtained from Pharmacia; Superscript One-step RT PCR 
Kit, fetal calf serum (FCS), bacto-tryptone and yeast extract obtained from Gibco 
BRL; Oligonucleotides obtained from Micro synth (Switzerland); restriction 
endonucleases obtained from Boehringer Mannheim, New England Biolabs or 
MBI Fermentas; Pwo polymerase and dNTPs obtained from Boehringer 
Mannheim. HP-1 medium was obtained from Cell culture technologies 
(Glattbrugg, Switzerland). All standard chemicals were obtained from 
Fluka-Sigma-Aldrich, and all cell culture materials were obtained from TPP. 

[0258] DNA manipulations were carried out using standard techniques. DNA 

was prepared according to manufacturer instruction either from a 2 ml bacterial 
culture using the QIAprep Spin Plasmid Kit or from a 50 ml culture using the 
QIAGEN Plasmid Midi Kit. For restriction enzyme digestion, DNA was 
incubated at least 2 hours with the appropriate restriction enzyme at a 
concentration of 5-10 units (U) enzyme per mg DNA under manufacturer 
recommended conditions (buffer and temperature). Digests with more than one 
enzyme were performed simultaneously if reaction conditions were appropriate for 
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all enzymes, otherwise consecutively. DNA fragments isolated for further 
manipulations were separated by electrophoresis in a 0.7 to 1.5% agarose gel, 
excised from the gel and purified with the QiaExII Gel Extraction Kit according 
to the instructions provided by the manufacturer. For ligation of DNA fragments, 
100 to 200 pg of purified vector DNA were incubated overnight with a threefold 
molar excess of the insert fragment at 1 6 ° C in the presence of 1 U T4 DNA ligase 
in the buffer provided by the manufacturer (total volume: 10-20 |il). An aliquot 
(0.1 to 0.5 jil) of the ligation reaction was used for transformation ofE. coli 
XL 1 -Blue (Stratagene). Transformation was done by electroporation using a 
Gene Pulser (BioRAD) and 0. 1 cm Gene Pulser Cuvettes (BioRAD) at 200 Q, 25 
liF, 1 .7 kV. After electroporation, the cells were incubated with shaking for 1 h 
in 1 ml SOB. medium (Miller, 1972) before plating on selective S.O.B. agar. 

EXAMPLE 1: 

Insertion of the JUN amphiphatic helix domain within E2 
[0259] In the vector pTE5^2J (Hahn et al., Proa Natl. Acad. ScL USA 

SP:2679-2683 ? (1992)), MM and a BstEll restriction enzyme sites were 
introduced between codons 71 (Gin) and 74 (Thr) of the structural protein E2 
coding sequence, resulting in vector pTE5 N 2JBM. Introduction of these 
restriction enzymes sites was done by PCR using the following oligonucleotides: 

Oligo 1: 

E2insBstEII/BssHII: 

5 '-ggggACGCGTGCAGCAggtaaccaccgTTAAAGAAGGCACC-3 ' (SEQ ID 
NO:l) 

Oligo 2: 
E2insMluIStuI: 

5 '-cggtggttaccTGCTGCACGCGTTGCTTAAGCGACATGTAGCGG-3 ' (SEQ 
IDNO:2) 

Oligo 3: 

E2insStuI: 5'-CCATGAGGCCTACGATACCC-3' (SEQIDNO:3) 



OUgo4: 
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E2insBssHII: 5 '-GGCACTCACGGCGCGCTTTACAGGC-3 ' (SEQ ID NO: 4) 

[0260] For the PGR reaction, 100 pmol of each oligo was used with 5 ng of the 

template DNA in a 100 |il reaction mixture containing 4 units of Taq or Pwo 
polymerase, 0. 1 mM dNTPs and 1.5 mM MgS0 4 . All DNA concentrations were 
determined photometrically using the GeneQuant apparatus (Pharmacia). 
Polymerase was added directly before starting the PGR reaction (starting point 
was 95 °C). Temperature cycling was done in the following manner and order: 
95°C for 2 minutes; 5 cycles of 95°C (45 seconds), 53 °C (60 seconds), 72°C 
(80 seconds); and 25 cycles of 95°C (45 seconds), 57°C (60 seconds), 72°C 
(80 seconds). 

[0261] The two PGR fragments were analyzed and purified by agarose 

gelelectrophoresis. Assembly PCR of the two PCR fragments using oligo 3 and 
4 for amplification was carried out to obtain the final construct. 

[0262] For the assembly PCR reaction, 100 pmol of each oligo was used with 

2 ng of the purified PCR fragments in a 1 00 jul reaction mixture containing 4 units 
of Taq or Pwo polymerase, 0.1 mM dNTPs and 1.5 mM MgS0 4 . All DNA 
concentrations were determined photometrically using the GeneQuant apparatus 
(Pharmacia). Polymerase was added directly before starting the PCR reaction 
(starting point was 95 ° C). Temperature cycling was done in the following manner 
and order: 95 °C for 2 minutes; 5 cycles of 95 °C (45 seconds), 57 °C 
(60 seconds), 72°C (90 seconds); and 25 cycles of 95°C (45 seconds), 59°C 
(60 seconds), 72 °C (90 seconds). 

[0263] The final PCR product was purified using Qia spin PCR columns (Qiagen) 

and digested in an appropriate buffer using 10 units each of BssHII and StuI 
restriction endonucleases for 12 hours at 37°C. The DNA fragments were 
gel-purified and ligated into BssHII/StuI digested and gel-purified pTE5 '2 J vector 
(Hahn et aL, Proc. Natl Acad Set USA 5P:2679-2683). The correct insertion 
of the PCR product was first analyzed by BstEII and Mlul restriction analysis and 
then by DNA sequencing of the PCR fragment. 
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[0264] The DNA sequence coding for the JUN amphiphatic helix domain was 

PCR-amplified from vector pJuFo (Crameri and Suter, Gene 137:69 (1993)) using 

the following oligonucleotides: 

Oligo 5: 
JUNBsiEII: 

5 '-CCTTCTTTAAcggtggttaccTGCTGGCAACCAACGTGGTTCATGAC-3 ' 
(SEQIDNO:5) 

Oligo 6: 

MhiUUN: 5 '-AAGCATGCTGCacgcgtgTGCGGTGGTCGGATCGCCCGGC-3 ' 
(SEQIDNO:6) 

[0265] For the PCR reaction, 100 pmol of each oligo was used with 5 ng of the 

template DNA in a 100 (il reaction mixture containing 4 units of Taq or Pwo 
polymerase, 0. 1 mM dNTPs and 1.5 mM MgS0 4 . All DNA concentrations were 
determined photometrically using the GeneQuant apparatus (Pharmacia). 
Polymerase was added directly before starting the PCR reaction (starting point 
was 95 °C). Temperature cycling was done in the following order and manner: 
95°C for 2 minutes; 5 cycles of 95°C (45 seconds), 60°C (30 seconds), 72°C 
(25 seconds); and 25 cycles of 95°C (45 seconds), 68°C (30 seconds), 72°C 
(20 seconds). 

[0266] The final PCR product was gel-purified and ligated into EcoRV digested 

and gel-purified pBluescript II(KS") . From the resulting vector, the JUN sequence 
was isolated by cleavage with Mlul/BstEll purified with QiaExII and ligated into 
vector pTE5'2JBM (previously cut with the same restriction enzymes) to obtain 
the vector pTE5^2J:E2/CW. 

EXAMPLE 2: 

Production of viral particles containing E2-JUN using the pCYTts system 
[0267] The structural proteins were PCR amplified using pTE5'2J:E2JUN as 

template and the oligonucleotides XbalStruct 

(ctatcaTCTAGAATGAATAGAGGATTCTTTAAC (SEQ ID NO: 12)) and 
StructBspl201 (tcgaatGGGCCCTCATCTTCGTGTGCTAGTCAG (SEQ ID 
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NO:87)). For the PGR 100 pmol of each loligo was used and 5 ng of the template 
DNA was used in the 100 |ul reaction mixture, containing 4 units of Tac or Pwo 
polymerase, 0. 1 mM dNTPs and 1.5 mM MgS0 4 . All DNA concentrations were 
determined photometrically using the GeneQuant apparatus (Pharmacia). The 
polymerase was added directly before starting the PCR reaction (starting point 
was 95 °C). The temperature cycles were as follows: 95 °C for 3 minutes, 
followed by 5 cycles of 92°C (30 seconds), 54°C (35 seconds), 72°C (270 
seconds) and followed by 25 cycles of 92° C (30 seconds), 63 °C (35 seconds), 
72 °C (270 seconds. The PCR product was gel purified and digested with the 
restriction enzymes Xbal/Bspl201 and ligated into vector pCYTts previously 
cleaved with the same enzymes (WO 99/50432) 

[0268] Twenty |ag of pCYTtsE2:JW were incubated with 30 U of Seal in an 

appropriate buffer for at least 4 hours at 37°C. The reaction was stopped by 
phenol/chloroform extraction, followed by an isopropanol precipitation of the 
linerized DNA. The restriction reaction was checked by agarose gel 
eletrophoresis. For the transfection, 5.4 \ig of linearized pCYTtsE2:J£W was 
mixed with 0.6 jig of linearized pSV2Neo in 30 jllI H 2 0 and 30 jil of 1 M CaCl 2 
solution were added. After addition of 60 jul phosphate buffer (50 mM HEPES, 
280 mM NaCl, 1.5 mM Na 2 HP0 4 , pH 7.05), the solution was vortexed for 5 
seconds, followed by an incubation at room temperature for 25 seconds. The 
solution was immediately added to 2 ml HP-1 medium containing 2% FCS (2% 
FCS medium). The medium of an 80% confluent BHK21 cell culture in a 6-well 
plate was then replaced with the DNA containing medium. After an incubation 
for 5 hours at 37° C in a C0 2 incubator, the DNA containing medium was 
removed and replaced by 2 ml of 1 5% glycerol in 2% FCS medium. The glycerol 
containing medium was removed after a 3 0 second incubation phase, and the cells 
were washed by rinsing with 5 ml of HP-1 medium containing 10% FCS. Finally 
2 ml of fresh HP-1 medium containing 10% FCS was added. 

[0269] Stably transfected cells were selected and grown in selection medium 

(HP- 1 medium, supplemented with G4 1 8) at 3 7 ° C in a CQ 2 incubator. When the 



WO 01/85208 



PCT/IB01/00741 



-73- 

mixed population was grown to confluency, the culture was split to two dishes, 
followed by a 12 hours growth period at 37 °C. One dish of the cells was shifted 
to 30 °C to induce the expression of the viral particles; the other dish was kept at 
37°C. 

[0270] The expression of viral particles was determined by Western blotting 

(Figure 1). Culture medium (0.5 ml) was methanol/chloroform precipitated, and 
the pellet was resuspended in SDS-PAGE sample buffer. Samples were heated 
for 5 minutes at 95 °C before being applied to 15% acrylamide gel. After 
SDS-PAGE, proteins were transferred to Protan nitrocellulose membranes 
(Schleicher & Schuell, Germany) as described by Bass and Yang, in Creighton, 
T.E., ed., Protein Function: A Practical Approach, 2nd Edn., IRL Press, Oxford 
(1997), pp. 29-55. The membrane was blocked with 1% bovine albumin (Sigma) 
in TBS (lOxTBS per liter: 87.7 gNaCl, 66. Ig Trizma hydrochloride (Sigma) and 
9.7 g Trizma base (Sigma), pH 7.4) for 1 hour at room temperature, followed by 
an incubation with an anti-El/E2antibody (polyclonal serum) for 1 hour.- The blot 
was washed 3 times for 10 minutes with TBS-T (TBS with 0.05% Tween20), and 
incubated for 1 hour with an alkaline-phosphatase-anti-rabbit IgG conjugate (0. 1 
|ig/ml, Amersham Life Science, England). After washing 2 times for 10 minutes 
with TBS-T and 2 times for 10 minutes with TBS, the development reaction was 
carried out using alkaline phosphatase detection reagents (10 ml AP buffer (100 
mM Tris/HCl, 100 mMNaCl, pH 9.5) with 50 ^1 NET solution (7.7% Nitro Blue 
Tetrazolium (Sigma) in 70% dimethylformamide) and 37 (xl of X-Phosphate 
solution (5% of 5-bromo-4-chloro-3-indolyl phosphate in dimethylformamide). . 

[0271] The production of viral particles is shown in Figure 1 . The Western Blot 

pattern showed that E2-JUN (lane 1) migrated to a higher molecular weight in 
SDS-PAGE compared to wild type E2 (lane 2) and the BHK21 host cell line did 
not show any background. 
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EXAMPLE 3 : 
Production of viral particles containing E2-JUN 
using the pTE5 '2JE2 : JUN vector 
[0272] RNase-free vector (1.0 ng) was linerarized by Pvul digestion. 

Subsequently, in vitro transcription was carried out using an SP6 in vitro 
transcription kit (InvitroscripCAP by InvitroGen, Invitrogen BV, NV Leek, 
Netherlands). The resulting 5 '-capped mRNA was analyzed on a reducing 
agarose-gel. 

[0273] In vitro transcribed mRNA (5 |ig) was electroporated into BHK 21 cells 

(ATCC: CCL10) according to Invitro gen's manual (Sindbis Expression system, 
Invitrogen BV, Netherlands). After 10 hours incubation at 37 °C, the FCS 
containing medium was exchanged by HP- 1 medium without FCS, followed by an 
additional incubation at 37°C for 10 hours. The supernatant was harvested and 
analyzed by Western blot analysis for production of viral particles exactly as 
described in Example 2. 

[0274] The obtained result was identical to the one obtained with pC YTtsE2 ; JUN 

as shown in Figure 2. 

EXAMPLE 4: 

Fusion of human growth hormone (hGH) to the FOS leucine 
zipper domain (OmpA signal sequence) 

[0275] The hGH gene without the human leader sequence was amplified from the 

original plasmid (ATCC 3 1389) by PGR. Oligo 7 with an internal Xbal site was 
designed for annealing at the 5 ' end of the hGH gene, and oligo 9 with an internal 
EcoRI site primed at the 3' end of the hGH gene. For the PGR reaction, 100 
pmol of each oligo and 5 ng of the template DNA was used in the 75 j^l reaction 
mixture (4 units of Taq or Pwo polymerase, 0. 1 mM dNTPs and 1 . 5 mM MgS0 4 ). 

[0276] PGR cycling was performed in the following manner: 30 cycles with an 

annealing temperature of 60 °C and an elongation time of 1 minute at 72 °C. 

[0277] The gel purified and isolated PGR product was used as a template for a 

second PCR reaction to introduce the ompA signal sequence and the 
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Shine-Dalgarno sequence. For the PGR reaction, 100 pmol of oligo 8 and 9 and 
1 ng of the template PCR fragment was used in the 75 \xl reaction mixture (4 units 
of Taq or Pwo polymerase, 0.1 mM dNTPs and 1.5 mMMgS0 4 ). The annealing 
temperature for the first five cycles was 55 °C with an elongation time of 60 
seconds at 72 ° C; another 25 cycles were performed with an annealing temperature 
of 65 °C and an elongation time of 60 seconds at 72°C. 
[0278] Oligo? : gggtctagattcccaaccattcccttatccaggctttttgac aacgctatgctccgcgccc 

atcgtctgcaccagctggcctttgacacc (SEQ ID NO: 7); oligo 8: gggtctagaaggaggtaaaaaa 
cgatgaaaaagacagctatcgcgattgcagtggcactggctggtttcgctaccgtagcgcaggccttcccaac 
cattcccttatcc (SEQ ID NO: 8); oligo 9: cccgaattcctagaagccacagctgccctcc (SEQ ID 
NO:9). 

[0279] The resulting recombinant hGH gene was subcloned into pBluescript via 

Xbal/EcoRL The correct sequence of both strands was confirmed by DNA 
sequencing. 

[0280] The DNA sequence coding for the FOS amphiphatic helix domain was 

PCR-amplified from vector pJuFo (Crameri & Suter Gene 137:69 (1993)) using 
the oligonucleotides: 
omp-FOS: 

5'- ccTGCGGTGGTCTGACCGACACCC-3 ' (SEQ ID NO: 10) 
FOS-hgh: 

5'- ccgcggaagagccaccGCAACCACCGTGTGCCGCCAGGATG-3' (SEQ ID 
NO: 11) 

[0281] For the PCR reaction, 100 pmol of each oligo and 5 ng of the template 

DNA was used in the 75 jlxI reaction mixture (4 units of Taq or Pwo polymerase, 
0.1 mM dNTPs and 1.5 mM MgS0 4 ). The temperature cycles were as follows: 

[0282] 95 °C for 2 minutes, followed by 5 cycles of 95 °C (45 seconds), 60 °C 

(30 seconds), 72 °C (25 seconds) and followed by 25 cycles of 95 °C (45 seconds), 
68°C (30 seconds), 72°C (20 seconds). 
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[0283] The PGR product was purified, isolated and cloned into the StuI digested 

pBluescript-ompA-hGH. The hybrid gene was then cloned into the pKK223-3 
Plasmid (Pharmacia). 

EXAMPLE 5: 
Bacterial expression of FOS-hGH 

[0284] The ompA-FOS-hGH in pkk223 -3 was expressed under the control of the 

inducible IPTG-dependend promoter using JM101 as E, coli host strain. 
Expression was performed in shaker flask. Cells were induced with 1 raM IPTG 
(final concentration) at an OD600 of 0.5. Expression was continued for 10 hours 
at37°C. Cellswereharvestedby centrifugationat 3600 at 10°Cfor 15min. The 
cell pellet was frozen (-20 °C or liq. N 2 ) and stored for 16 hours. The pellet was 
then thawed at 4 ° C and resuspended in 1 0 ml 1 0 niM Tris-HCl, pH 7.4 containing 
600 raM sucrose. After stirring for 15 min at 4°C, periplasmic proteins were 
released by an osmotic shock procedure. Chilled (4 ° C) deionized H 2 0 was added, 
and the suspension was stirred for 30 min at 4°C. The sludge was diluted, 
resuspended, and lysozyme was added to degrade the cell wall of the bacteria. 
The cells and the periplasmic fraction spheroplasts were separated by 
centrifugation for 20 min at 11000 x g at 4°C. The FOS-hGH-containing 
supernatant was analyzed by reducing and non-reducing SDS-Page and Dot Blot. 
Dot Blot was carried out as described in Example 8, using an anti-hGH antibody 
(Sigma) as the first antibody and an alkaline phosphatase (AP)-anti-mouse 
antibody conjugate as the second antibody. 

[0285] Full length, correctly processed FOS-hGH. could be detected under 

reducing and non-reducing conditions. Part of FOS-hGH was bound to other, 
non-identified proteins due to the free cysteines present in the FOS amphiphatic 
helix. However, more than 50% of expressed FOS-hGH occurred in its native 
monomeric conformation ( Figure 3). 

[0286] Purified FOS-hGH will be used to perform first doping experiments with 

JUN containing viral particles. 
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EX AMPLE 6: 

Construction of the pAV vector series for expression of FOS fUsion proteins 
[0287] A versatile vector system was constructed that allowed either cytplasmic 

production or secretion of N- or C-terminal FOS fusion proteins in E. coli or 
production of N- or C-terminal FOS fusion proteins in eukaryotic cells. The 
vectors pAVl - pAV4 which was designed for production of FOS fusion proteins 
in E. coli, encompasses the DNA cassettes listed below, which contain the 
following genetic elements arranged in different orders: (a) a strong ribosome 
binding site and 5" -untranslated region derived from the E. coli ompA gene 
(aggaggtaaaaaacg) (SEQ ID NO: 13); (b) a sequence encoding the signal peptide 
of E. coli outer membrane protein OmpA (MKKTAIAIAVALAGFATVAQA) 
(SEQ ID NO : 14); (c) a sequence coding for the FOS dimerization domain flanked 
on both sides by two glycine residues and a cystein residue 
(CGGLTDTLQAETDQVEDEKSALQTEIANLLKEKEKLEFILAAHGGC) 
(SEQ ID NO: 15); and (d) a region encoding a short peptidic linker (AAASGG 
(SEQ ID NO: 16) or GGSAAA (SEQ ID NO: 17)) connecting the protein of 
interest to the FOS dimerization domain. Relevant coding regions are given in 
upper case letters. The arrangement of restriction cleavage sites allows easy 
construction of FOS fusion genes with or without a signal sequence. The 
cassettes are cloned into the EcoRI/Hindlll restriction sites of expression vector 
pKK223-3 (Pharmacia) for expression of the fusion genes under control of the 
strong tac promotor. 

pAVl 

[0288] This vector was designed for the secretion of fusion proteins with FOS at 

the C-terminus into the E. coli periplasmic space. The gene of interest (g.o.i.) 
may be ligated into the StuI/NotI sites of the vector. 

EcoRI 31/11 

craa ttc agg agg taa aaa acg ATG AAA AAG ACA GCT ATC GCG ATT GCA 
GTG GCA CTG GCT 
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MKKTAI A I A 

V A L A 

61/21 StuI NotI 

GGT TTC GCT ACC GTA GCG C AG GCC t qcr gtg ggg GCG GCC GC T TCT GGT 
GGT TGC GGT GGT 

GFATVAQA (goi) A A A S G 

G C G G 

121/41 151/51 

CTG ACC GAC ACC CTG CAG GCG GAA ACC GAC CAG GTG GAA GAC GAA AAA 
TCC GCG CTG CAA 

LTDTL QAETDQVE DEK 
S A L Q 

181/61 211/71 

ACC GAA ATC GCG AAC CTG CTG AAA GAA AAA GAA AAG CTG GAG TTC ATC 
CTG GCG GCA CAC 

TEIANLLKEKEKLEFI 
L A A H 

241/81 Hindlll 

GGT GGT TGC t aa crct t (SEQIDNO:18) 

g g c * a (SEQ ID NOs: 14 and 19) 

pAV2 

[0289] This vector was designed for the secretion of fusion proteins with FOS at 

the N-terminus into the E. coli periplasmic space. The gene of interest (g.o.i.) 
ligated into the Notl/EcoRV (or Notl/Hindlll) sites of the vector. 



EcoRI 31/11 

qaa ttc agg agg taa aaa acg ATG AAA AAG ACA GCT ATC GCG ATT GCA 
GTG GCA CTG GCT 

MKKTAIAIA 

V A L A 

61/21 StuI 91/31 

GGT TTC GCT ACC GTA GCG C AG GCC T GC GGT GGT CTG ACC GAC ACC CTG 
CAG GCG GAA ACC 

GFATVAQACGGLTDTL 
Q A E T 
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121/41 . 151/51 

GAC CAG GTG GAA GAC GAA AAA TCC GCG CTG CAA ACC GAA ATC GCG AAC 
CTG CTG AAA GAA 

D QVEDEKSALQTEIAN 
L L K E 

181/61 211/71 
Not I 

AAA GAA AAG CTG GAG TTC ATC CTG GCG GCA CAC GGT GGT TGC GGT GGT 
TCT GCG GCC GC T 

KEKLEFI LAAHGGCGG 
S A A A 

241/81 EcoRV Hind.HI 

ggg tgt ggg gat ate aag ctt (SEQ ID NO: 20) 

(goi) (SEQIDNO:21) 
PAV3 

[0290] This vector was designed for the cytoplasmic production of fusion proteins 

withFOS at the C-terminus inE. coli. The gene of interest (g.oi.) may be ligated 
into the EcoRV/NotI sites of the vector. 

EcoRI EcoRV NotI 

qaa ttc agg agg taa aaa gat ate ggg tgt ggg GCG GCC GC T TCT GGT 
GGT TGC GGT GGT 

(goi) A A A S G 

G C G G 

61/21 91/31 

CTG ACC GAC ACC CTG CAG GCG GAA ACC GAC CAG GTG GAA GAC GAA AAA 
TCC GCG CTG CAA 

L> TD TLQAE TDQVEDEK 
S A L Q 

121/41 151/51 

ACC GAA ATC GCG AAC CTG CTG AAA GAA AAA GAA AAG CTG GAG TTC ATC 
CTG GCG GCA CAC 

T E I ANLLKEKEKLE F I 
L A A H 



181/61 Hindi I I 

GGT GGT TGC t aa get t (SEQ ED NO: 22) 

g g c * (SEQIDNO:23) 
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pAV4 

[0291] This vector is designed for the cytoplasmic production of fusion proteins 

withi^OS at the N-terminus ini£ coli. The gene of interest (g.o.i.) may be ligated 
into the Notl/EcoRV (or Notl/Hindlll) sites of the vector. The N-terminal 
methionine residue is proteolytically removed upon protein synthesis (Hirel etal, 
Proc. Natl Acad. Sci. USA 55:8247-8251 (1989)). 

EcoRI 31/11 

qaa ttc agg agg taa aaa a eg ATG GCT TGC GGT GGT CTG ACC GAC ACC 
CTG CAG GCG GAA 

EFRR * KTMAC GGLTDT 
L Q A E 

61/21 91/31 

ACC GAC CAG GTG GAA GAC GAA AAA TCC GCG CTG CAA ACC GAA ATC GCG 
AAC CTG CTG AAA 

TDQVEDEKSALQTEIA 
N L L K 

121/41 151/51 
Not I 

GAA AAA GAA AAG CTG GAG TTC ATC CTG GCG GCA CAC GGT GGT TGC GGT 
GGT TCT GCG GCC 

E KE KLEFILAAHGGCG 
G S A A 

181/61 EcoRV Hindi I I 

GC T ggg tgt ggg cyat ate aaq ctt (SEQ ID NO. 24) 

a (goi) (SEQ ID NOs:88 and 25) 

[0292] The vectors pAV5 and pAV6, which are designed for eukaryotic 

production of FOS fusion proteins, encompasses the following genetic elements 
arranged in different orders: (a) a region coding for the leader peptide of human 
growth hormone (MATGSRTSLLLAFGLLCLPWLQEGSA) (SEQ ID NO:26); 
(b) a sequence coding for the FOS dimerization domain flanked on both sides by 
two glycine residues and a cysteine residue 

(CGGLTDTLQAETDQVEDEKSALQTEIANLLKEKEKLEFILAAHGGC) 
(SEQ ID NO: 15); and 
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(c) a region encoding a short peptidic linker (AAASGG (SEQ ID NO: 16) or 
GGSAAA (SEQ ID NO: 17)) connecting the protein of interest to the FOS 
dimerization domain. Relevant coding regions are given in upper case letters. 
The arrangement of restriction cleavage sites allows easy construction of FOS 
fusion genes. The cassettes are cloned into the EcoRI/Hindlll restriction sites of 
the expression vector pMPSVEH (Artelt etaL, Gene 68:213-219 (1988)). 



pAV5 

[0293] This vector is designed for the eukaryotic production of fusion proteins 

withFOS at the C-terminus. The gene of interest (g.o.i.) may be inserted between 
the sequences coding for the hGH signal sequence and the FOS domain by ligation 
into the Eco47III/NotI sites of the vector. Alternatively, a gene containing its 
own signal sequence may be fused to the FOS coding region by ligation into the 
StuI/NotI sites. 



EcoRI StuI 31/11 

craa ttc aaa cct ATG GCT ACA GGC TCC CGG ACG TCC CTG CTC CTG GCT 
TTT GGC CTG CTC 

MATGSRTS LLLA 

F G L L 

61/21 Eco47III Not I 



TGC CTG CCC TGG CTT CAA GAG GGC 
TCT GGT GGT TGC 

CLPWLQEG 
S G G C 

121/41 



AGC GCT ggg tgt ggg GCG GCC GC T 
S A (goi) AAA 

151/51 



GGT GGT CTG ACC GAC ACC CTG CAG 
GAA AAA TCC GCG 

GGLTDTLQ 
E K S A 

181/61 



GCG GAA ACC GAC CAG GTG GAA GAC 
AETDQVED 

211/71 



CTG CAA ACC GAA ATC GCG AAC CTG CTG AAA GAA AAA GAA AAG CTG GAG 
TTC ATC CTG GCG 

LQT E IANLLKEKE KLE 
F I L A 
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241/81 Hindlll 

GCA CAC GGT GGT TGC t aa get t (SEQ ID NO.27) 

a h g g c (SEQIDNO:28) 
pAV6 

[0294] This vector is designed for the eukaryotic production of fusion proteins 

with FOS at the N-terminus. The gene of interest (g.o.i.) may be ligated into the 
Notl/StuI (or Notl/Hindlll) sites of the vector. 

EcoRI 31/11 

qaa ttc ATG GCT ACA GGC TCC CGG ACG TCC CTG CTC CTG GCT TTT GGC 
CTG CTC TGC CTG 

MATGS RTSLLLAFG 
L L C L 

61/21 EC047III 91/31 

CCC TGG CTT CAA GAG GGC AGC GCT TGC GGT GGT CTG ACC GAC ACC CTG 
CAG GCG GAA ACC 

PWLQEGSACGGLTDTL 
Q A E T 

121/41 * 151/51 

GAC CAG GTG GAA GAC GAA AAA TCC GCG CTG CAA ACC GAA ATC GCG AAC 
CTG CTG AAA GAA 

DQVEDEK SALQTEX AN 
L L K E 

181/61 211/71 
NotI 

AAA GAA AAG CTG GAG TTC ATC CTG GCG GCA CAC GGT GGT TGC GGT GGT 
TCT GCG GCC GC T 

KEKLEFI LAAHGGCGG 
S A A A 



241/81 StuI Hindlll 

ggg tgt ggg acra cct aag ctt 
(goi) 



(SEQIDNO:29) 
(SEQIDNO:30) 
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Construction of expression vectors pAVl - pAV6 

[0295] The following oligonucleotides have been synthesized for construction of 

expression vectors pAVl - pAV6: 
FOS-FORl: 

CCTGGGTGGGGGCGGCCGCTTCTGGTGGTTGCGGTGGTCTGACC(SEQ 

IDNO:31); 

FOS-FOR2: 

GGT GGG A ATTC AGGAGGT AA A A AGAT ATCGGGT GTGGGGC GGC C 

(SEQ ID NO . 3 2); 

FOS-FOR3: 

GGTGGGAATTCAGGAGGTAAAAAACGATGGCTTGCGGTGGTCTGACC 

(SEQIDNO:33); 

FOS-FOR4: 

GCTTGCGGTGGTCTGACC (SEQ ID NO:34); 
FOS-KEV1: 

CCACCAAGCTTAGCAACCACCGTGTGC (SEQ ID NO:35); 
FOS-KEV2: 

CCACCAAGCTTGATATCCCCACACCCAGCGGCCGCAGAACCACCGC 

AACCACCG (SEQ ID NO:36); 

FOS'-REV3: 

CCACCAAGCTTAGGCCTCCCACACCCAGCGGC (SEQ IDNO:37); 
OmpA-FORl: 

GGTGGGAATTCAGGAGGTAAAAAACGATG (SEQ ID NO:38); 
hGH-FORl: 

GGTGGGAATTCAGGCCTATGGCTACAGGCTCC (SEQ ID NO:39); and 
hGH-FOR2. 

GGTGGGAATTCATGGCTACAGGCTCCC (SEQ ID NO:40). 
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[0296] For the construction of vector pAV2, the regions coding for the OmpA 

signal sequence and the FOS domain were amplified from the omp A-FOS-hGTA 
fusion gene in vector pKK223-3 (see Example 5) using the primer pair 
OmpA-FORl/ FOS-KEV2. The PCR product was digested with EcoRI/Hindlll 
and ligated into the same sites of vector pKK223-3 (Pharmacia). 

[0297] For the construction of vector pAVl, the FOS coding region was 

amplified from the ompA-FO/S-hGH fusion gene in vector pKK223-3 (see 
Example 5) using the primer pair FO.S'-FORl/FO^-REVl. The PCR product was 
digested with Hindlll and ligated into Stul/Hindlll digested vector pAV2. 

[0298] For the construction of vector pAV3, the region coding for the FOS 

domain was amplified from vector pAVl using the primer pair 
FOS-FORl/FOS-KEV 1 . The PCR product was digested with EcoRMIindlll and 
ligated into the same sites of the vector pKK223-3 (Pharmacia). 

[0299] For the construction of vector pAV4 3 the region coding for the FOS 

domain was amplified from the omp A-FOS-hGH fusion gene in vector pKK223 -3 
(see Example 5) using the primer pair FOS-FOR3/FOS-KEV2. The PCR product 
was digested with EcoRI/Hindlll and ligated into the same sites of the vector 
pKK223-3 (Pharmacia). 

[0300] For the construction of vector p AV5 5 the region coding for the hGH signal 

sequence is amplified from the hGH-FOS-hGH fusion gene in vector pSINrep5 
(see Example 7) using the primer pair hGH-F OR 1 /hGHRE V 1 . The PCR product 
is digested with EcoRI/NotI and ligated into the same sites of the vector pAVl . 
The resulting cassette encoding the hGH signal sequence and the FOS domain is 
then isolated by EcoRI/Hindlll digestion and cloned into vector pMPSVEH 
(Artelt et al, Gene (55:213-219 (1988)) digested with the same enzymes. 

[0301] For the construction of vector pAV6 5 the FOS coding region is amplified 

from vector pAV2 using the primer pair FOS-FOR4/FOSKEV3 . The PCR 
product is digested withHindlll and cloned into Eco47III/HindIII cleaved vector 
pAV5. The entire cassette encoding the hGH signal sequence and the FOS 
domain is then reamplified from the resulting vector using the primer pair 
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hGH-FOR2/FOSREV3, cleaved with EcoRI/Hindlll and ligated into vector 
pMPSVEH (Artelt et al, Gene 68:213-219 (1988)) cleaved with the same 
enzymes. 

EXAMPLE 7: 

Construction of FOS-hGH with human (hGH) signal sequence 
[0.302] For eukaryotic expression of the FOS-hGH fusion protein, the 

OmpA-i^Ctf-hGH fusion gene was isolated from pBluescript::OmpA-F0S-hGH 
(see Example 4) by digestion with Xbal/Bsp 1 201 and cloned into vector pSINrepS 
(Invitrogen) cleaved with the same enzymes. The hGH signal sequence was 
synthesized by PCR (reaction mix: 50 pmol of each primer, dATP, dGTP, dTTP, 
dCTP (200 |iM each), 2.5 U Taq DNA polymerase (Qiagen), 50 |il total volume 
in the buffer supplied by the manufacturer; amplification: 92 °C for 30 seconds, 
55 °C for 30 seconds, 72 °C for 30 seconds, 30 cycles) using the overlapping 
oligonucleotides Sig-hGH-FOR 

(GGGTCTAGAATGGCTACAGGCTCCCGGACGTCCCTGCTCCTGGCTT 
TTGGCCTGCTCTG) (SEQ ID NO:41) and Sig-hGH-REV 
(CGCAGGCCTCGGCACTGCCCTCTTGAAGCCAGGGCAGGCAGAGCA 
GGCCAAAAGCCAG) (SEQ ID NO:42). The PCR product was purified using 
the QiaExII Kit, digested with Stul/Xbal and ligated into vector 
p SINrep5 : : Omp A-^Ctf-hGH cleaved with the same enzymes. 

EXAMPLE 8: 
Eukaryotic expression of FOS-hGH 
[0303] RNase-free vector (1.0 |ug) (pSINrep5::OmpA-F0S-hGH) and 1.0 \ig of 

DHEB (Bredenbeek et al, J. Virol 67:6439-6446 (1993)) were linerarized by 
Seal restriction digest. Subsequently, in vitro transcription was carried out using 
an SP6 in vitro transcription kit (InvitroscripCAP by InvitroGen, Invitrogen BV, 
NV Leek, Netherlands). The resulting 5 '-capped mRNA was analyzed on 
reducing agarose-gel. 
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[0304] In vitro, transcribed mRNA 5 |Lig was electroporated into BHK 21 cells 

(ATCC: CCL10) according to Invitrogen's manual (Sindbis Expression system, 
Invitrogen BV, Netherlands). After 10 hours incubation at 37° C the FCS 
containing medium was exchanged by HP- 1 medium without FCS, followed by an 
additional incubation at 37 °C for 10 hours. The supernatant was harvested and 
analyzed by dot-blot analysis for production of FOS-hgh. 

[0305] Culture media (2.5 jliI) was spotted on a nitrocellulose membrane and dried 

for 1 0 minutes at room temperature. The membrane was blocked with 1 % bovine 
albumin (Sigma) in TBS (lOxTBS per liter: 87.7 g NaCl,. 66.1g Trizma 
hydrochloride (Sigma) and 9.7 g Trizma base (Sigma), pH 7.4) for 1 hour at room 
temperature, followed by an incubation with 2 jig rabbit anti-human hGH antibody 
(Sigma) in 10 ml TBS-T (TBS with 0.05% Tween20) for 1 hour. The blot was 
washed 3 times for 10 minutes with TBS-T and incubated for 1 hour with alkaline 
phosphatase conjugated anti-rabbit IgG (Jackson ImmunoResearch Laboratories, 
Inc.) diluted 1 :5000 in TBS-T. After washing 2 times for 10 minutes with TBS-T 
and 2 times for 10 minutes with TBS, the blot was developed by AP staining as 
described in Example 2. Results are shown in Figure 3. 

EXAMPLE 9: 
Construction of FOS-PLA (N- and C-terminal) 

[0306] The following gene is constructed by chemical gene synthesis coding for 

a catalytically inactive variant (Forster et al, J. Allergy Clin. Immunol 95: 
1229-1235 (1995)) of bee venom phospholipase A 2 (PLA). 

1/1 31/11 

ATC ATC TAC CCA GGT ACT CTG TGG TGT GGT CAC GGC AAC AAA TCT TCT 
GGT CCG AAC GAA 

IIYPGTLWCGHGNKSS 
G P N E 

61/21 91/31 

CTC GGC CGC TTT AAA CAC ACC GAC GCA TGC TGT CGC ACC CAG GAC ATG 
TGT CCG GAC GTC 

LGRFKHTDACCRTQDM 
C .P D V 
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121/41 151/51 

ATG TCT GCT GGT GAA TCT AAA CAC GGG TTA ACT AAC ACC GCT TCT CAC 
ACG CGT CTC AGC 

MSAGE SKHGLTNTASH 
T R L S 

181/61 211/71 



TGC GAC TGC GAC GAC AAA TTC TAC 
ACC ATC TCT TCT 

CDCDDKF Y 
T I S S 

241/81 

TAC TTC GTT GGT AAA ATG TAT TTC 
AAA CTG GAA CAC 

YFVGKMYF 
K L E H 

301/101 ' 



GAC TGC CTT AAG AAC TCC GCC GAT 
DCIjKNSAD 

271/91 

AAC CTG ATC GAT ACC AAA TGT TAC 
NLIDTKCY 

331/111 



CCG GTA ACC GGC TGC GGC GAA CGT ACC GAA GGT CGC TGC CTG CAC TAC 
ACC GTT GAC AAA 

PVTGCGERTEGRCLHY 
T V D K 

361/121 391/131 



TCT AAA CCG AAA GTT TAC CAG TGG TTC GAC CTG CGC AAA TAC (SEQ 

ID NO:43) 

SKPKVYQ WFDLRKY (SEQ 

IDNO.44) 



[0307] For fusion of PL A to the N-terminus of the FOS dimerization domain, the 

region is amplified using the oligonucleotides PLA-FOR1 
(CCATCATCTACCCAGGTAC) (SEQ ID NO:45) and PLA-REV1 
(CCCACACCCAGCGGCCGCGTATTTGCGCAGGTCG) (SEQ ID NO:46). 
The PCR product is cleaved with NotI and ligated into vector pAVl previously 
cleaved with the restriction enzymes Stul/Notl. For fusion of PL A to the 
C-terminus of the FOS dimerization domain, the region is amplified using the 
oligonucleotides PLA-FOR2 

(CGGTGGTTCTGCGGCCGCTATCATCTACCCAGGTAC) (SEQ IDNO.47) 
and PLA-REV2 (TTAGTATTTGCGCAGGTCG) (SEQ ID NO:48). The PCR 
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product is cleaved withNotl and ligated into vector pAV2 previously cleaved with 
the restriction enzymes Notl/EcoRV. 

EXAMPLE 10: 

Construction of FOS-Ovalbumin fusion gene (N- and C-terminal) 
[0308] For cloning of the ovalbumin coding sequence, mRNA from chicken 

oviduct tissue is prepared using the QuickPrep™ Micro mRNA Purification Kit 
(Pharmacia) according to manufacturer instructions. Using the Superscript™ 
One-step RT PCR Kit (Gibco BRL), a cDNA encoding the mature part of 
ovalbumin (corresponding to nucleotides 68-1222 of the mRNA (McReynolds et 
al, Nature 273:723-728 (1978)) is synthesized using the primers Ova-FORl 
(CCGGCTCCATCGGTGCAG) (SEQ ID NO .49) and Ova-REVl 
(ACCACCAGAAGCGGCCGCAGGGGAAACACATCTGCC) (SEQ IDNO:50). 
The PCR product is digested withNotl and cloned into StuI/NotI digested vector 
pAVl for expression of the fusion protein with the FOS dimerization domain at 
the C terminus. For production of a fusion protein with the FOS dimerization 
domain at the N terminus, the Ovalbumin coding region is amplified from the 
constructed vector (pAVl::Ova) using the primers Ova-FOR2 
(CGGTGGTTCTGCGGCCGCTGGCTCCATCGGTGCAG) (SEQ ID NO:51) 
and Ova-REV2 (TTAAGGGGAAACACATCTGCC) (SEQ ID NO:52). The 
PCR product is digested with NotI and cloned into the Notl/EcoRV digested 
vector pAV2. Cloned fragments are verified by DNA sequence analysis. 

EXAMPLE 11 
Production and purification of FOS-PLA and 
FOS ovalbumin fusion proteins 
[0309] For cytoplasmic production of FOS fusion proteins, an appropriate E. coli 

strain was transformed with the vectors p AV3 : :PL A, p AV4 : :PLA, p AV3 : : Ova or 
pAV4::Ova. The culture was incubated in rich medium in the presence of 
ampicillin at 37°C with shaking. At an optical density (550nm) of 1, 1 mMIPTG 
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was added and incubation was continued for another 5 hours. The cells were 
harvested by centrifugation, resuspended in an appropriate buffer (e.g., tris-HCl, 
pH 7.2, 150 mMNaCl) containing DNase, RNase and lysozyme, and disrupted 
by passage through a french pressure cell. After centrifugation (Sorvall RC-5C, 
SS34 rotor, 15000 rpm, 10 min, 4°C), the pellet was resuspended in 25 ml 
inclusion body wash buffer (20 mM tris-HCl, 23% sucrose, 0.5% Triton X-100, 
1 mMEDTA, pH8) at 4°C and recentrifuged as described above. This procedure 
was repeated until the supernatant after centrifugation was essentially clear. 
Inclusion bodies were resuspended in 20 ml solubilization buffer (5.5 M 
guanidinium hydrochloride, 25 mM tris-HCl, pH 7.5) at room temperature and 
insoluble material was removed by centrifugation and subsequent passage of the 
supernatant through a sterile filter (0.45 |im). The protein solution was kept at 

4 ° C for at least 1 0 hours in the presence of 1 0 mM EDT A and 1 00 mM DTT and 
then dialyzed three times against 1 0 volumes of 5 . 5 M guanidinium hydrochloride, 
25 mM tris-HCl, 10 mM EDT A, pH 6. The solution was dialyzed twice against 

5 liters of 2 M urea, 4 mM EDTA, 0. 1 MNH 4 C1, 20 mM sodium borate (pH 8.3) 
in the presence of an appropriate redox shuffle (oxidized glutathione/reduced 
glutathione; cystine/cysteine). The refolded protein was then applied to an ion 
exchange chromatography. The protein was stored in an appropriate buffer with 
a pH above 7 in the presence of 2-10 mM DTT to keep the cysteine residues 
flanking the FOS domain in a reduced form. Prior to coupling of the protein with 
the alphavirus particles, DTT was removed by passage of the protein solution 
through a Sephadex G-25 gel filtration column. 

EXAMPLE 12: 
Constructions of gpl40-FOS 
[0310] jThe gpl40 gene (Swiss-Prot:P03375) without the internal protease 

cleavage site was amplified by PGR from the original plasmid pAbT4674 (ATCC 
40829) containing the full length gp 1 60 gene using the following oligonucleotides : 
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HIV-1: 

5'-ACTAGTCTAGAatgagagtgaaggagaaatatc-3' (SEQ IDNO:53); 
HlV-end: 

S'-TAGCATGCTAGCACCGAAtttatctaattccaataattcttg-S' (SEQ ID NO:54); 
HIV-Cleav: 

5 ' -gtagcacccaccaaggcaaagCTGAAAGCT ACCC AGCTCGAGAAACTGgca-3 ' 

(SEQ ID NO: 5 5); and 

HIV-Cleav2: 

5 '-caaagctcctattcccactgcCAGTTTCTCGAGCTGGGT AGCTTTCAG-3 ' 
(SEQ ID NO: 56). 

[0311] For PCR I, 100 pmol of oligo HIV-1 and HIV~Cleav2 and 5 ng of the 

template DNA were used in the 75 jil reaction mixture (4 units of Taq or Pwo 
polymerase, 0. 1 mM dNTPs and 1.5 mM MgS0 4 ). PCR cycling was done in the 
following manner: 30 cycles with an annealing temperature of 60 °C and an 
elongation time of 2 minutes at 72 °C. 

[0312] For PCR II, 100 pmol of oligo HIV-end and HIV-Cleav and 5 ng of the 

template DNA were used in the 75 jul reaction mixture, (4 units of Taq or Pwo 
polymerase, 0. 1 mM dNTPs and 1.5 mM MgS0 4 ). PCR cycling was done in the 
following manner: 30 cycles with an annealing temperature of 60 °C and an 
elongation time of 50 seconds at 72 °C. 

[0313] Both PCR fragments were purified, isolated and used in an assembly PCR 

reaction. For the assembly PCR reaction, 100 pmol of oligo HIV-1 and HIV-end 
and 2 ng of each PCR fragment (PCRI and PCR II) were used in the 75 \il (4 units 
of Taq or Pwo polymerase, 0. 1 mM dNTPs and 1.5 mM MgS0 4 ). PCR cycling 
was done in the following manner: 30 cycles with an annealing temperature of 
60 ° C and an elongation time of 2 . 5 minutes at 72 ° C . The assembly PCR product 
was digested Xbal and Nhel. The FOS amphiphatic helix was fused in frame to 
the C-terminal end of gp-140. 
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[0314] The DNA sequence coding for the FOS amphiphatic helix domain was 

PCR-amplified from vector pJuFo (Crameri & Suter Gene 137:69 (1993)) using 
the oligonucleotides: 
FOS-BIV: 

5'-ttcggtgctagcggtggcTGCGGTGGTCTGACCGAC-3' (SEQ ID NO:57); and 
FOS-Apa: 

5'-gatgctgggcccttaaccGCAACCACCGTGTGCCGCC-3' (SEQ ID NO:58). 

[0315] For the PCR reaction, 100 pmol of each oligo and 5 ng of the template 

DNA was used in the 75 |il reaction mixture (4 units of Taq or Pwo polymerase, 
0.1 mM dNTPs and 1 . 5 mM MgS0 4 ). Temperature cycling was done as follows: 
95 °C for 2 minutes, followed by 5 cycles of 95 °C (45 seconds), 60°C (30 
seconds), 72 °C (25 seconds) and followed by 25 cycles of 95 °C (45 seconds), 
68 ° C (3 0 seconds), 72 ° C (20 seconds). The obtained PCR fragment was digested 
with Nhel and Bspl20L. 

[0316] The final expression vector for GP140-FOS was obtained in a 3 fragment 

ligation of both PCR fragments into pSinRepS. The resultant vector 
pSinRep5~GP 1 40-FOS was evaluated by restriction analysis and DNA sequencing. 

[0317] GP 140-FOS was also cloned into pCYTts via Xbal and Bsp 120L to obtain 

a stable, inducible GP 140-FOS expressing cell line. 

EXAMPLE 13: 
Expression of GP140FOS using pSinRep5-GP140FOS 
[0318] RNase-free vector (1.0 |ig)(pSinRep5-GP140-FOS)and 1.0 jug of DHEB 

(Bredenbeek et aL, J. Virol. 57:6439-6446 (1993)) were linearized by restriction 
digestion. Subsequently, in vitro transcription was carried out using an SP6 in 
vitro transcription kit (InvitroscripCAP by InvitroGen, Invitrogen B V, NV Leek, 
Netherlands). The resulting 5 '-capped mRNA was analyzed on a reducing 
agarose-gel. 

[0319] In vitro transcribed mRNA (5 |ug) was electroporated into BHK 21 cells 

(ATCC: CCL10) according to Invitrogen's manual (Sindbis Expression System, 
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Invitrogen BV, Netherlands). After 10 hours incubation at 37 °C, the FCS 
containing medium was exchanged by HP- 1 medium without FCS, followed by an 
additional incubation at 37°C for 10 hours. The supernatant was harvested and 
analyzed by Western blot analysis for production of soluble GP 140-FOS exactly 
as described in Example 2. 

EXAMPLE 14: 
Expression of GP140FOS using pCYTts-GP140FOS 
[0320] pCYT-GP 140-FOS 20 jig was linearized by restriction digestion. The 

reaction was stopped by phenol/chloroform extraction, followed by an isopropanol 
precipitation of the linearized DNA. The restriction digestion was evaluated by 
agarose gel eletrophoresis. For the transfection, 5.4 jig of linearized 
pCYTtsGP 140-FOS was mixed with 0.6 \ig of linearized pSV2Neo in 30 ^1 H 2 0 
and 30 pi of 1 M CaCl 2 solution was added. After addition of 60 \il phosphate 
buffer (50 mMHEPES, 280 mMNaCl, 1.5 mM Na 2 HP0 4 , pH7,05), the solution 
was vortexed for 5 seconds, followed by an incubation at room temperature for 
25 seconds. The solution was immediately added to 2 ml HP-1 medium 
containing 2% FCS (2% FCS medium). The medium of an 80% confluent 
BHK21 cell culture (6-well plate) was then replaced by the DNA containing 
medium. After an incubation for 5 hours at 37° C in a C0 2 incubator, the DNA 
containing medium was removed and replaced by 2 ml of 1 5% glycerol in 2% FCS 
medium. The glycerol containing medium was removed after a 30 second 
incubation phase, and the cells were washed by rinsing with 5 ml of HP-1 medium 
containing 10% FCS. Finally 2 ml of fresh HP-1 medium containing 10% FCS 
was added. 

[0321] Stably transfected cells were selected and grown in selection medium 

(HP-1 medium supplemented with G418) at 37° C in a C0 2 incubator. When the 
mixed population was grown to confluency, the culture was split to two dishes, 
followed by a 12 h growth period at 37°C. One dish of the cells was shifted to 
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30 °C to induce the expression of soluble GP14Q-FOS. The other dish was kept 
at37°C. 

[0322] The expression of soluble GP140-FOS was determined by Western blot 

analysis. Culture media (0.5 ml) was methanol/chloroform precipitated, and the 
pellet was resuspended in SDS-PAGE sample buffer. Samples were heated for 5 
minutes at 95 °C before being applied to a 15% acrylamidegel. After SDS-PAGE, 
proteins were transferred to Protan nitrocellulose membranes (Schleicher & 
Schuell, Germany) as described by Bass and Yang, in Creighton, T.E., ed., 
Protein Function: A Practical Approach, 2nd Edn., IRL Press, Oxford (1997), 
pp. 29-55. The membrane was blocked with 1 % bovine albumin (Sigma) in TBS 
(lOxTBS per liter: 87.7 g NaCl, 66. lg Trizma hydrochloride (Sigma) and 9.7 g 
Trizma base (Sigma), pH 7.4) for 1 hour at room temperature, followed by an 
incubation with an anti-GP140 or GP-160 antibody for 1 hour. The blot was 
washed 3 times for 10 minutes with TBS-T (TBS with' 0.05% Tween20), and 
incubated for 1 hour with an alkaline-phosphatase-anti- 
mouse/rabbit/monkey/human IgG conjugate. After washing 2 times for 10 
minutes with TBS-T and 2 times for 10 minutes with TBS, the development 
reaction was carried out using alkaline phosphatase detection reagents (10 ml AP 
buffer (1 00 mMTris/HCl, lOOmMNaCl, P H9.5)with50 jliINBT solution (7.7% 
Nitro Blue Tetrazolium (Sigma) in 70% dimethylformamide) and 37 \il of 
X-Phosphate solution (5% of 5-bromo-4-chloro-3-indolyl phosphate in 
dimethylformamide) . 

EXAMPLE 15: 
Production and purification of GP140FOS 
[0323] An anti-gp 1 20 antibody was covalently coupled to a NHS/EDC activated 

dextran and packed into a chromatography column. The supernatant, containing 
GP14QFOS is loaded onto the column and after sufficient washing, GP 140FOS 
was eluted using 0. 1 M HC1. The eluate was directly neutralized during collection 
using 1 M Tris pH 7.2 in the collection tubes. 
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[0324] Disulfide bond formation might occur during purification, therefore the 

collected sample is treated with 10 raM DTT in 10 raM Tris pH 7.5 for 2 hours 
at25°C. 

[0325] DTT is remove by subsequent dialysis against 10 mMMes; 80 mMNaCl 

pH 6.0. Finally GP140FOS is mixed with alphavirus particles containing the JUN 
leucine zipper in E2 as described in Example 16. 

EXAMPLE 16: 
Preparation of the Alpha Vaccine Particles 

[0326] Viral particles {see Examples 2 and 3) were concentrated using Millipore 

Ultrafree Centrifugal Filter Devices with a molecular weight cut-off of 100 kD 
according to the protocol supplied by the manufacturer. Alternatively, viral 
particles were concentrated by sucrose gradient centrifiigation as described in the 
instruction manual of the Sindbis Expression System (Invitrogen, San Diego, 
California). The pH of the virus suspension was adjusted to 7. 5 and viral particles 
were incubated in the presence of 2- 1 0 mM DTT for several hours. Viral particles 
were purified from contaminating protein on a Sephacryl S-300 column 
(Pharmacia) (viral particles elute with the void volume) in an appropriate buffer. 

[0327] Purified virus particles were incubated with at least 240 fold molar excess 

of F<9£-antigen fusion protein in an appropriate buffer (pH 7.5-8.5) in the 
presence of a redox shuffle (oxidized glutathione/reduced glutathione; 
cystine/cysteine) for at least 1 0 hours at 4 ° C. After concentration of the particles 
using a Millipore Ultrafree Centrifugal Filter Device with a molecular weight 
cut-off of 100 kD, the mixture was passed through a Sephacryl S-300 gel filtration 
column (Pharmacia). Viral particles were eluted with the void volume. 
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EXAMPLE 17: 

Fusion of JUN amphipathic helix to the amino terminus of HBcAg(l-144) 
[0328] The JUN helix was fused to the amino terminus of the HBcAg amino acid 

sequence 1 to 144 (JUN-HBcAg construct). For construction of the JUN-HBcAg 
DNA sequence, the sequences encoding the JUN helix and HBcAg(l -144) were 
amplified separately by PGR. The JUN sequence was amplified from the pJuFo 
plasmid using primers EcoRI-JUN(s) and JUN-SacII(as). The EcoRI-JUN(s) 
primer introduced an EcoRI site followed by a start ATG codon. The JUN- 
SacII(as) primer introduced a linker encoding the amino acid sequence GAAGS. 
The HBcAg (1-144) sequence was amplified from the pEco63 plasmid (obtained 
from ATCC No. 31518) using primers JUN-HBcAg(s) and 
HBcAg(l-144)Hind(as). JUN-HBcAg(s) contained a sequence corresponding to 
the 3' end of the sequence encoding the JUN helix followed by a sequence 
encoding the GAAGS linker and the 5' end of the HBcAg sequence. HBcAg(l- 
144)Hind(as) introduces a stop codon and a Hindlll site after codon 144 of the 
HBcAg gene. For the PGR reactions, 100 pmol of each oligo and 50 ng of the 
template DNAs were used in the 50 )il reaction mixtures with 2 units of Pwo 
polymerase, 0. 1 mM dNTPs and 2 mM MgS0 4 . For both reactions, temperature 
cycling was carried out as follows: 94°C for 2 minutes; and 30 cycles of 94°C (1 
minute), 50°C (1 minute), 72°C (2 minutes). 
[0329] Primer sequences: 

EcoRI-JUN(s): 

(5 '-CCGGAATTCATGTGCGGTGGTCGGATCGCCCGG-3 ') (SEQ ID 
NO:61); 

JUN-SacII(as): 

(5 '-GTCGCTACCCGCGGCTCCGCAACCAACGTGGTTCATGAC-3 ') (SEQ 
ID NO:62); 
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JUN-HBcAg(s): 

(5 '-GTTGGTTGCGGAGCCGCGGGTAGCGACATTGACCCTTATAAAGAATTTGG-3 ') 
(SEQ ID NO:63); 

HBcAg(l-144)ffind(as): 

(5 '-CGCGTCCCAAGCTTCTACGGAAGCGTTGATAGGATAGG-3 ') (SEQ 
IDNO:64). 

[0330] Fusion of the two PGR fragments was performed by PGR using primers 

EcoRI- JUN(s) and HBcAg(l-144)Hind(as) . 1 00 pmol of each oligo was used with 
1 OOng of the purified PGR fragments in a 50 ]ul reaction mixture containing 2 units 
of Pwo polymerase, 0.1 mM dNTPs and 2 mM MgSQ 4 . PGR cycling conditions 
were: 94°C for 2 minutes; and 35 cycles of 94°C (1 minute), 50°C (1 minute), 
72 °C (2 minutes). The final PCR product was analyzed by agarose gel 
electrophoresis, purified and digested for 16 hours in an appropriate buffer with 
EcoRI and Hindlll restriction enzymes. The digested DNA fragment was ligated 
into EcoRI/Hindlll-digested pKK vector to generate pKK-JUN-HBcAg 
expression vector. Insertion of the PCR product was analyzed by EcoRI/Hindlll 
restriction analysis and by DNA sequencing of the insert. 

EXAMPLE 18 

Fusion of JUN amphipathic helix to the carboxy terminus of HBcAg(l-144) 
[0331] The JUN helix was fused to the carboxy terminus of the HBcAg amino 

acid sequence 1 to 144 (HBcAg- JUN construct). For construction of the HBcAg- 
JUN DNA sequence, the sequences encoding the JUN helix and HBcAg(l-144) 
were amplified separately by PCR. The JUN sequence was amplified from the 
pJuFo plasmid with primers SacII-JUN(s) and JUN-Hindlll(as). SacII-JUN(s) 
introduced a linker encoding amino acids LAAG. This sequence also contains a 
SacII site. JUN-Hindlll(as) introduced a stop codon (TAA) followed by a 
Hindlll site. The HBcAg(l-144) DNA sequence was amplified from the pEco63 
plasmid using primers EcoRI-HBcAg(s) and HBcAg(l-144)-JUN(as). EcoRI- 
HBcAg(s) introduced an EcoRI site prior to the Start ATG of the HBcAg coding 
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sequence. HBcAg(l-144)-JUN(as) introduces a sequence encoding the peptide 
linker (LAAG), which also contains a SacII site. For the PCR reactions, 100 pmol 
of each oligo and 50 ng of the template DNAs were used in the 50 pi reaction 
mixtures with 2 units of Pwo polymerase, 0.1 mM dNTPs and 2 mM MgS0 4 . 
Temperature cycling was carried out as follows: 94°C for 2 minutes; and 30 
cycles of 94°C (1 minute), 50°C (1 minute), 72°C (2 minutes). 
[0332] Primer sequences 

SacII-JUN(s): 

(5 '-CTAGCCGCGGGTTGCGGTGGTCGGATCGCCCGG-3 ') (SEQ ID 
NO:65); 

JUN-Hindlll(as): 

(5'-CGCGTCCCAAGCTTTTAGCAACCAACGTGGTTCATGAC -3') (SEQ 
ID NO:66); 

EcoRI-HBcAg(s): 

(5 '-CCGGAATTC ATGGACATTGACCCTTAT AAAG-3 ') (SEQ ID NO:67); 
and 

HBcAg-JUN(as): 

(5 '-CCGACCACCGCAACCCGCGGCTAGCGGAAGCGTTGATAGGATAGG-3 ') 
(SEQIDNO:68). 

[0333] Fusion of the two PCR fragments was performed by PCR using primers 

EcoRI-HBcAg(s) and JUN-HindHI(as). For the PCR fusion, 100 pmol of each 
oligo was used with lOOng of the purified PCR fragments in a 50 pi reaction 
mixture containing 2 units of Pwo polymerase, 0. 1 mM dNTPs and 2 mM MgS0 4 . 
PCR cycling conditions were: 94°C for 2 minutes; and 35 cycles of 94°C (1 
minute), 50 °C (1 minute), 72 °C (2 minutes). The final PCR product was 
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analyzed by agarose gel electrophoresis, and digested for 16 hours in an 
appropriate buffer with EcoRI and Hindlll restriction enzymes. The DNA 
fragment was gel purified and ligated into EcoRI/Hindlll-digested pKK vector to 
generate pKK-HBcAg- JUN expression vector. Insertion of the PGR product was 
analyzed by EcoRI/Hindlll restriction analysis and by DNA sequencing of the 
insert. 

EXAMPLE 19 

Insertion of JUN amphipathic helix into the c/el epitope of HBcAg(l-144) 
[0334] The c/el epitope (residues 72 to 88) of HBcAg is known to be located in 

the tip region on the surface of the Hepatitis B virus capsid. A part of this region 
(residues 76 to 82) of the protein was genetically replaced by the JUN helix to 
provide an attachment site for antigens (HBcAg- JUNIns construct) . The HBcAg- 
JUNIns DNA sequence was generated by PCRs: The JUN helix sequence and 
two sequences encoding HBcAg fragments (amino acid residues 1 to 75 and 83 
to 144) were amplified separately by PCR. The JUN sequence was amplified from 
the pJuFo plasmid with primers BamHI-JUN(s) and JUN-SacII(as). BamHI- 
JUN(s) introduced a linker sequence encoding the peptide sequence GSGGG that 
also contains a BamHI site. JUN-SacII(as) introduced a sequence encoding the 
peptide linker GAAGS followed by a sequence complementary to the 3 ' end of 
the JUN coding sequence. The HBcAg(l-75) DNA sequence was amplified from 
the pEco63 plasmid using primers EcoRIHBcAg(s) and HBcAg75-JUN(as). 
EcoRIHBcAg(s) introduced an EcoRI site followed by a sequence corresponding 
to the 5' end of the HBcAg sequence. HB c Ag7 5 - JUN(as) introduced a linker 
encoding the peptide GSGGG after amino acid 75 of HBcAg followed by a 
sequence complementary to the 5 ' end of the sequence encoding the JUN helix. 
The HBcAg (83-144) fragment was amplified using primers JUN-HBcAg83(s) 
and HBcAg(l-144)Hind(as). JUN-HBcAg83(s) contained a sequence 
corresponding to the 3 ' end of the JUN-encoding sequence followed by a linker 
encoding the peptide, GAAGS and a sequence corresponding to the 5 ' end of the 
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sequence encoding HBcAg (83-144). HBcAg(l-144)Hind(as) introduced a stop 
codon and a Hindlll site after codon 144 of the HBcAg gene. For the PCR 
reactions, 100 pmol of each oligo and 50 ng of the template DNAs were used in 
the 50 ul reaction mixtures (2 units of Pwo polymerase, 0.1 mM dNTPs and 2 
mM MgS0 4 ). Temperature cycling was performed as follows: 94°C for 2 
minutes; and 35 cycles of 94° C (1 minute), 50° C (1 minute), 72 °C (2 minutes). 

[0335] Primer sequences: 

BamHI-JUN(s): 

(5 '-CTAATGGATCCGGTGGGGGCTGCGGTGGTCGGATCGCCCGGCTCGAG-3 ') 
(SEQ ID NO:69); 

JUN-SacII(as): 

(5 '-GTCGCTACCCGCGGCTCCGCAACCAACGTGGTTC ATGAC-3 ') (SEQ 
ID NO: 70); 

EcoRIHBcAg(s): 

(5'- CCGGAATTCATGGACATTGACCCTTATAAAG-3 ') (SEQ ID NO:71); 
HBcAg75-JUN (as): 

(5 '-CCGACCACCGCAGCCCCCACCGGATCCATTAGTACCCACCCAGGTAGC-3 ') 
(SEQ ID NO:72); 

JUN-HBcAg83(s): 

(5 '-GTTGGTTGCGGAGCCGCGGGTAGCGACCTAGTAGTCAGTTATGTC-3 ') 
(SEQ ID NO.73); and 
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HBcAg(l-144)Hind(as): 

(5 '-CGCGTCCCAAGCTTCTACGGAAGCGTTGATAGGATAGG-3 ') (SEQ 
IDNO:74). 

[0336] Fusion of the three PCR fragments was performed as follows. First, the 

fragment encoding HBcAg 1-75 was fused with the sequence encoding JUN by 
PCR using primers EcoRIHBcAg(s) and JUN-SacII(as). Second, the product 
obtained was fused with the HBcAg(83-144) fragment by PCR using primers 
EcoRI HBcAg(s) and HBcAg Hindlll(as). For PCR fusions, 100 pmol of each 
oligo was used with 100 ng of the purified PCR fragments in a 50 ju.1 reaction 
mixture containing 2 units of Pwo polymerase, 0. 1 mM dNTPs and 2 mMMgS0 4 . 
The same PCR cycles were used as for generation of the individual fragments. 
The final PCR product was digested for 16 hours in an appropriate buffer with 
EcoRI and Hindlll restriction enzymes. The DNA fragment was ligated into 
EcoRI/Hindlll-digested pKK vector, yielding the pKK-HBcAg-JUNIns vector. 
Insertion of the PCR product was analyzed by EcoRI/Hindlll restriction analysis 
and by DNA sequencing of the insert. 

EXAMPLE 20 

Fusion of the JUN amphipathic helix to the carboxy terminus of the ' 
measles virus nucleocapsid (N) protein 
[0337] The JUN helix was fused to the carboxy terminus of the truncated measles 

virus N protein fragment comprising amino acid residues 1 to 473 (N473-JUN 
construct). For construction of the DNA sequence encoding N473-JUN the 
sequence encoding the JUN helix and the sequence encoding N473-JUN were 
amplified separately by PCR. The JUN sequence was amplified from the pJuFo 
plasmid with primers SacII-JUN(s) and JUN-Hindlll(as). SacII-JUN(s) 
introduced a sequence encoding peptide linker LAAG. This sequence also 
contained a SacII site. The JUN-Hindlll(as) anti-sense primer introduced a stop 
codon (TAA) followed by a Hindlll site. The N (1-473) sequence was amplified 
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from the pSC-N plasmid containing the complete measles virus N protein coding 
sequence (obtained from M. Billeter, Zurich) using primers EcoRI-Nmea(s) and 
Nmea-JUN(as). EcoRI-N(mea)(s) introduced an EcoRI site prior to the Start 
ATG of the N coding sequence. N(mea)-JUN(as) was complementary to the 3 ' 
end of the N( 1-473) coding sequence followed by a sequence complementary to 
the coding sequence for the peptide linker (LAAG). For the PGR reactions, 100 
pmol of each oligo and 50 ng of the template DNAs were used in the 50 jul 
reaction mixtures with 2 units of Pwo polymerase, 0.1 mM dNTPs and 2 mM 
MgS0 4 . Temperature cycling was performed as follows: 94°C for 2 minutes; and 
35 cycles of 94°C (1 minute), 55°C (1 minute), 72°C (2 minutes). 

[0338] Primer sequences: 

SacII-JUN(s): 

(5 '-CT AGCCGCGGGTTGCGGTGGTCGGATCGCCCGG-3 ') (SEQ ID 
NO:75); 

JUN-Hmdlll(as): 

(S'-CGCGTCCCAAGCTTTTAGCAACCAACGTGGTTCATGAC -3') (SEQ 
ID NO: 76); 

EcoRI-Nmea(s): 

(5'~CCGGAATTCATGGCCACACTTTTAAGGAGC-3 ') (SEQ IDNO:77); and 
Nmea-JUN(as): 

(5 ' -CGCGTCCC AAGCTTTT AGC AACC AACGTGGTTCATGAC-3 ') (SEQ ID 
NO:78). 

Fusion of the two PCR fragments was performed in a further PCR using 
primers EcoRI-Nmea(s) and Nmea-JUN(as). For the PCR fusion, 100 pmol of 
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each oligo was used with 1 00 ng of the purified PCR fragments in a 50 |il reaction 
mixture containing 2 units of Pwo polymerase, 0. 1 mM dNTPs and 2 mM MgS0 4 . 
Temperature cycling was performed as follows: 94 ° C for 2 minutes; and 3 5 cycles 
of 94°C (1 minute), 50°C (1 minute), 72°C (2 minutes). The PCR product was 
digested for 1 6 hours in an appropriate buffer with EcoRI and Hindlll restriction 
enzymes. The DNA fragment was gel purified and ligated into EcoRI/Hindlll- 
digested pKK vector, yielding the pKK-N473 - JUN plasmid. Insertion of the PCR 
product was analyzed by EcoRI/Hindlll restriction analysis and by DNA 
sequencing of the insert. 

Example 21 

Expression and partial purification of HBcAg-JUN 
[0339] E. coli strain XL-1 blue was transformed with pKK-HBcAg-JUN. 1 ml 

of an overnight culture of bacteria was used to innoculate 100 ml of LB medium 
containing 1 00 |ng/ml ampicillin. This culture was grown for 4 hours at 3 7 ° C until 
an OD at 600 nm of approximately 0.8 was reached. Induction of the synthesis 
of HBcAg-JUN was performed by addition of IPTG to a final concentration of 1 
mM. After induction, bacteria were further shaken at 37°C for 16 hours. 
Bacteria were harvested by centriflxgation at 5000 x g for 15 minutes. The pellet 
was frozen at -20 °C. The pellet was thawed and resuspended in bacteria lysis 
buffer (10 mM Na 2 HP0 4 , pH 7.0, 30 mM NaCl, 0.25% Tween-20 3 1 0 mM 
EDTA, 10 mM DTT) supplemented with 200 |ig/ml lysozyme and 10 nl of 
Benzonase (Merck). Cells were incubated for 30 minutes at room temperature 
and disrupted using a French pressure cell Triton X-100 was added to the lysate 
to a final concentration of 0.2%, and the lysate was incubated for 30 minutes on 
ice and shaken occasionally. Figure 4 shows HBcAg-JUN protein expression in 
E. coli upon induction with IPTG. E. coli cells harboring pKK-HB cAg-JUN 
expression plasmid or a control plasmid were used for induction of HBcAg-JUN 
expression with IPTG. Prior to the addition of IPTG, a sample was removed from 
the bacteria culture carrying the pKK-HB c Ag- JUN plasmid (lane 3) and from a 
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culture carrying the control plasmid (lane 1). Sixteen hours after addition of 
IPTG, samples were again removed from the culture containing pKK-HBcAg- 
JUN (lane 4) and from the control culture (lane 2). Protein expression was 
monitored by SDS-PAGE followed by Coomassie staining. 

[0340] The lysate was then centrifuged for 30 minutes at 12,000 x g in order to 

remove insoluble cell debris. The supernatant and the pellet were analyzed by 
Western blotting using a monoclonal antibody against HBcAg (YVS1841, 
purchased from Accurate Chemical and Scientific Corp., Westbury, NY, USA), 
indicating that a significant amount of HBcAg-JUN protein was soluble (Fig. 5). 
Briefly, lysates from£ coli cells expressing HBcAg-JUN and from control cells 
were centrifuged at 14,000 x g for 30 minutes. Supernatant (= soluble fraction) 
and pellet (= insoluble fraction) were separated and diluted with SDS sample 
buffer to equal volumes. Samples were analyzed by SDS-PAGE followed by 
Western blotting with anti-HBcAg monoclonal antibody YVS 1841. Lane 1: 
soluble fraction, control cells; lane 2: insoluble fraction, control cells; lane 3: 
soluble fraction, cells expressing HBcAg-JUN; lane 4: insoluble fraction, cells 
expressing HbcAg-JUN. 

[0341] The cleared cell lysate was used for step-gradient centrifugation using a 

sucrose step gradient consisting of a 4 ml 65% sucrose solution overlaid with 3 
ml 15% sucrose solution followed by 4 ml of bacterial lysate. The sample was 
. centrifuged for 3 hrs with 100,000 x gat 4° C. After centrifugation, 1ml fractions 
from the top of the gradient were collected and analyzed by SDS-PAGE followed 
by Coomassie staining. (Fig. 6). Lane 1: total E. coli lysate prior to 
centrifugation. Lane 1 and 2: fractions 1 and 2 from the top of the gradient. 
Lane 4 to 7: fractions 5 to 8 (15% sucrose). The HBcAg-JUN protein was 
detected by Coomassie staining. 

[0342] The HBcAg-JUN protein was enriched at the interface between 15 and 

65% sucrose indicating that it had formed a capsid particle. Most of the bacterial 
proteins remained in the sucrose-free upper layer of the gradient, therefore step- 
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gradient centrifiigation of the HBcAg-JUN particles led both to enrichment and 
to a partial purification of the particles. 

EXAMPLE 22 
Covalent Coupling of hGH-FOS to HBcAg-JUN 
[0343] In order to demonstrate binding of a protein to HBcAg-JUN particles, we 

chose human growth hormone (hGH) fused with its carboxy terminus to the FOS 
helix as a model protein (hGH-FOS). HBcAg-JUN particles were mixed with 
partially purified hGH-FOS and incubated for 4 hours at 4° C to allow binding of 
the proteins. The mixture was then dialyzed overnight against a 3000-fold volume 
of dialysis buffer (150 mM NaCl, 10 mM Tris-HCl solution, pH 8.0) in order to 
remove DTT present in both the HB c Ag- JUN solution and the hGH-FO S solution 
and thereby allow covalent coupling of the proteins through the establishment of 
disulfide bonds. As controls, the HBcAg-JUN and the hGH-FOS solutions were 
also dialyzed against dialysis buffer. Samples from all three dialyzed protein 
solutions were analyzed by SDS-P AGE under non-reducing conditions. Coupling 
of hGH-FOS to HBcAg-JUN was detected in an anti-hGH immunoblot (Fig. 7). 
hGH-FOS bound to HBcAg-JUN should migrate with an apparent molecular mass 
of approximately 53 kDa, while unbound hGH-FOS migrates with an apparent 
molecular mass of 31 kDa. The dialysate was analyzed by SDS-P AGE in the 
absence of reducing agent (lane 3) and in the presence of reducing agent (lane 2) 
and detected by Coomassie staining. As a control, hGH-FOS that had not been 
mixed with capsid particles was also loaded on the gel in the presence of reducing 
agent (lane 1). 

[0344] A shift of hGH-FOS to a molecular mass of approximately 53 kDa was 

observed in the presence of HBcAg-JUN capsid protein, suggesting that efficient 
binding of hGH-FOS to HBcAg-JUN had taken place. 
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EXAMPLE 23 
Insertion of a peptide containing a Lysine residue into the 
c/el epitope of HBcAg(l-149) 

[0345] The c/el epitope (residues 72 to 88) of HBcAg is located in the tip region 

on the surface of the Hepatitis B virus capsid (HBcAg). A part of this region 
(Proline 79 and Alanine 80) was genetically replaced by the peptide Gly-Gly-Lys- 
Gly-Gly (HBcAg-Lys construct). The introduced Lysine residue contains a 
reactive amino group in its side chain that can be used for intermolecular chemical 
crosslinking of HBcAg particles with any antigen containing a free cysteine group. 

[0346] HBcAg-Lys DNA, having the amino acid sequence shown in SEQ ID 

NO:158, was generated by PCRs: The two fragments encoding HBcAg fragments 
(amino acid residues 1 to 78 and 81 to 149) were amplified separately by PGR. 
The primers used for these PCRs also introduced a DNA sequence encoding the 
Gly-Gly-Lys-Gly-Gly peptide. The HBcAg (1 to 78) fragment was amplified from 
pEco63 using primers EcoRIHBcAg(s) and Lys-HBcAg(as). The HBcAg (8 1 to 
149) fragment was amplified from pEco63 using primers Lys-HBcAg(s) and 
HB c Ag( 1 - 1 49)Hind(as) . Primers Lys-HBcAg(as) and Lys-HBcAg(s) introduced 
complementary DNA sequences at the ends of the two PCR products allowing 
fusion of the two PCR products in a subsequent assembly PCR. The assembled 
fragments were amplified by PCR using primers EcoRIHBcAg(s) and HbcAg(l- 
149)Hind(as). 

[0347] For the PCRs, 100 pmol of each oligo and 50 ng of the template DNAs 

were used in the 50 jliI reaction mixtures with 2 units of Pwo polymerase, 0. 1 mM 
dNTPs and 2 mM MgS04. For both reactions 5 temperature cycling was carried 
out as follows: 94°C for 2 minutes; 30 cycles of 94°C (1 minute), 50°C (1 
minute), 72 °C (2 minutes). 



WO 01/85208 



PCT/IB01/00741 



-106- 

[0348] Primer sequences: 

EcoRIHBcAg(s): 

(5 '-CCGGAATTCATGGACATTGACCCTTATAAAG-3 ') (SEQ ID NO:79); 
Lys-HB c Ag(as) : 

(5'-CCTAGAGCCACCTTTGCCACCATCTTCTAAATTAGTACCCACCCAG 
GTAGC-3') (SEQ ID NO: 80); 

Lys-HB cAg(s): 

(5 ' -GAAGATGGTGGC AAAGGTGGCTCTAGGGACCTAGTAGTC AGTTAT 
GTC -3') (SEQ ID NO:81); 

HB c Ag( 1 - 1 49)Hind(as) : 

(5 '-CGCGTCCCAAGCTTCTAAACAACAGTAGTCTCCGGAAG-3 ') (SEQ ID 
NO:82). 

[0349] For fusion of the two PCR fragments by PCR 100 pmol of primers 

EcoRIHBcAg(s) and HBcAg(l-149)Hind(as) were used with 100 ng of the two 
purified PCR fragments in a 50 ul reaction mixture containing 2 units of Pwo 
polymerase, 0.1 mM dNTPs and 2 mM MgS0 4 . PCR cycling conditions were: 
94°C for 2 minutes; 30 cycles of 94°C (1 minute), 50°C (1 minute), 72°C (2 
minutes). The assembled PCR product was analyzed by agarose gel 
electrophoresis, purified and digested for 19 hours in an appropriate buffer with 
EcoRI and Hindlll restriction enzymes. The digested DNA fragment was ligated 
into EcoRl/Hindlll-digested pICK vector to generate pKK-HBcAg-Lys expression 
vector. Insertion of the PCR product into the vector was analyzed by 
EcoRI/Hindin restriction analysis and DNA sequencing of the insert. 

[0350] The amino acid sequence of the HBcAg-Lys polypeptide is 

MDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTASALYREAIESPEHCSP 
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HHTALRQAILCWGELMTLATWVGTNLEDGGKGGSRDLVVSYVNTNM 
GLKIRQLLWFffiS CLTC S TL 

PETTVV (SEQ ID NO: 185). This sequence differs from SEQ ID NO: 134 at 
amino acid 74 (N in SEQ ID NO: 13 14, T in SEQ ID NO: 185) and at amino acid 
87 (N in SEQ ID NO: 134, S in SEQ ID NO: 185). 

EXAMPLE 24 
Expression and partial purification of EDBcAg-Lys 
[0351] E. coli strain XL-1 blue was transformed with pKK-HBcAg-Lys. 1 ml of 

an overnight culture of bacteria was used to innoculate 100 ml of LB medium 
containing 1 00 |ig/ml ampicillin. This culture was grown for 4 hours at 3 7 ° C until 
an OD at 600 nm of approximately 0.8 was reached. Induction of the synthesis 
of HBcAg-Lys was performed by addition of IPTG to a final concentration of 1 
mM. After induction, bacteria were further shaken at 37°C for 16 hours. 
Bacteria were harvested by centrifugation at 5000 x g for 15 minutes. The pellet 
was frozen at -20 °C. The pellet was thawed and resuspended in bacteria lysis 
buffer (10 mM Na 2 HP0 4? pH 7.0, 30 mM NaCl, 0.25% Tween-20, 10 mM 
EDTA, 10 mM DTT) supplemented with 200 fig/ml lysozyme and 10 \A of 
Benzonase (Merck). Cells were incubated for 30 minutes at room temperature 
and disrupted using a French pressure cell. Triton X-100 was added to the lysate 
to a final concentration of 0.2%, and the lysate was incubated for 30 minutes on 
ice and shaken occasionally. E. coli cells harboring pKK-HBcAg-Lys expression 
plasmid or a control plasmid were used for induction of HBcAg-Lys expression 
with IPTG. Prior to the addition of IPTG, a sample was removed from the 
bacteria culture carrying the pKK-HB c Ag-Ly s plasmid and from a culture carrying 
the control plasmid. Sixteen hours after addition of IPTG, samples were again 
removed from the culture containing pKK-HB c Ag-Ly s and from the control 
culture. Protein expression was monitored by SDS-P AGE followed by Coomassie 
staining. 
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[0352] The lysate was then centrifuged for 30 minutes at 12,000 x g in order to 

remove insoluble cell debris. The supernatant and the pellet were analyzed by 
Western blotting using a monoclonal antibody against HBcAg (YVS1841, 
purchased from Accurate Chemical and Scientific Corp., Westbury, NY, USA), 
indicating that a significant amount of HBcAg-Lys protein was soluble. Briefly, 
lysates from E. coli cells expressing HBcAg-Lys and from control cells were 
centrifuged at 14,000 x g for 30 minutes. Supernatant (= soluble fraction) and 
pellet (= insoluble fraction) were separated and diluted with SDS sample buffer 
to equal volumes. Samples were analyzed by SDS -PAGE followed by Western 
blotting with anti-HBcAg monoclonal antibody YVS 1841. 

[0353] The cleared cell lysate was used for step-gradient centrifugation using a 

sucrose step gradient consisting of a 4 ml 65% sucrose solution overlaid with 3 
ml 15% sucrose solution followed by 4 ml of bacterial lysate. The sample was 
centrifuged for 3 hrs with 100,000 x g at 4 ° C. After centrifugation, 1 ml fractions 
from the top of the gradient were collected and analyzed by SDS-PAGE followed 
by Coomassie staining. The HBcAg-Lys protein was detected by Coomassie 
staining. 

[0354] The HBcAg-Lys protein was enriched at the interface between 15 and 

65% sucrose indicating that it had formed a capsid particle. Most of the bacterial 
proteins remained in the sucrose-free upper layer of the gradient, therefore step- 
gradient centrifugation of the HBcAg-Lys particles led both to enrichment and to 
a partial purification of the particles. 

EXAMPLE 25 
Chemical coupling of FLAG peptide to HBcAg-Lys 
using the heterobifiinctional cross-linker SPDP 
[0355] Synthetic FLAG peptide with a Cysteine residue at its amino terminus 

(amino acid sequence CGGDYKDDDDK (SEQ ID NO: 147)) was coupled 
chemically to purified HBcAg-Lys particles in order to elicit an immune response 
against the FLAG peptide. 600 \i\ of a 95% pure solution of HBcAg-Lys particles 
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(2 mg/ml) were incubated for 30 minutes at room temperature with the 
heterobifunctional cross-linker N-Succinimidyl 3-(2-pyridyldithio) propionate 
(SPDP) (0.5 mM). After completion of the reaction, the mixture was dialyzed 
overnight against 1 liter of 50 mM Phosphate buffer (pH 7.2) with 150 mM NaCl 
to remove free SPDP. Then 500 jal of derivatized HBcAg-Lys capsid (2 mg/ml) 
were mixed with 0. 1 mM FLAG peptide (containing an amino-terminal cysteine) 
in the presence of 1 0 mM EDTA to prevent metal-catalyzed sulfhydryl oxidation. 
The reaction was monitored through the increase of the optical density of the 
solution at 343 nm due to the release of pyridine-2-thione from SPDP upon 
reaction with the free cysteine of the peptide. The reaction of derivatized Lys 
residues with the peptide was complete after approximately 30 minutes. 
[0356] The FLAG decorated particles were injected into mice. 

EXAMPLE 26 
Construction of pMP S V-gp 1 4 0 cy s 

[0357] The gpl40 gene was amplified by PGR from pCytTSgpl40FOS using 

oligos gp 140CysEcoRI and Sallgp 140. For the PCRs, 1 00 pmol of each oligo and 
50 ng of the template DNAs were used in the 50 p.1 reaction mixtures with 2 units 
of Pwo polymerase, 0.1 mM dNTPs and 2 mM MgS04. For both reactions , 
temperature cycling was carried out as follows: 94°C for 2 minutes; 30 cycles of 
94°C (0.5 minutes), 55°C (0.5 minutes), 72°C (2 minutes). 

[0358] The PGR product was purified using QiaEXII kit, digested with 

Sall/EcoRI and ligated into vector pMPSVHE cleaved with the same enzymes. 

[0359] Oligo sequences: 



Gpl40CysEcoRI: 

S'-GCCGAATTCCTAGCAGCTAGCACCGAATTTATCTAA-S' (SEQ ID 
NO: 83); 
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Sallgpl40: 

5'- GGTT AAGTCGAC ATGAGAGTGAAGGAGAAATAT-3 5 (SEQIDNO:84). 

EXAMPLE 27 
Expression of pMPSVgpl40Cys 
[0360] pMPSVgpl40Cys (20 jig) was linearized by restriction digestion. The 

reaction was stopped by phenol/chloroform extraction, followed by an isopropanol 
precipitation of the linearized DNA. The restriction digestion was evaluated by 
agarose gel eletrophoresis. For the transfection, 5.4 jxg of linearized 
pMPSVgpl40-Cys was mixed with 0.6 \xg of linearized pSV2Neo in 30 H 2 0 
and 30 jliI of 1 M CaCl 2 solution was added. After addition of 60 \il phosphate 
buffer (50 mMHEPES, 280 mMNaCl 5 1.5 mM Na 2 HP0 4 , pH 7.05), the solution 
was vortexed for 5 seconds, followed by an incubation at room temperature for 
25 seconds. The solution was immediately added to 2 ml HP-1 medium 
containing 2% FCS (2% FCS medium). The medium of an 80% confluent 
BHK21 cell culture (6-well plate) was then replaced by the DNA containing 
medium. After an incubation for 5 hours at 37° C in a C0 2 incubator, the DNA 
containing medium was removed and replaced by 2 ml of 1 5% glycerol in 2% FCS 
medium. The glycerol containing medium was removed after a 30 second 
incubation phase, and the cells were washed by rinsing with 5 ml of HDP- 1 medium 
containing 10% FCS. Finally 2 ml of fresh HP-1 medium containing 10% FCS 
was added. 

[0361] Stably transfected cells were selected and grown in selection medium 

(HP-1 medium supplemented with G418) at 37°C in a C0 2 incubator. When the 
mixed population was grown to confluency, the culture was split to two dishes, 
followed by a 12 h growth period at 37 °C. One dish of the cells was shifted to 
30 °C to induce the expression of soluble GP140-FOS. The other dish was kept 
at 37°C. 

[0362] The expression of soluble GPl40-Cys was determined by Western blot 

analysis. Culture media (0.5 ml) was methanol/chloroform precipitated, and the 
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pellet was resuspended in SDS-PAGE sample buffer. Samples were heated for 5 
minutes at 95 0 C before being applied to a 1 5% acrylamide gel. After SDS-PAGE, 
proteins were transferred to Protan nitrocellulose membranes (Schleicher & 
Schuell, Germany) as described by Bass and Yang, in Creighton, T.E., ed., 
Protein Function: A Practical Approach, 2nd Edn., IRL Press, Oxford (1997), 
pp. 29-55. The membrane was blocked with 1 % bovine albumin (Sigma) in TBS 
(lOxTBS per liter: 87.7 g NaCl, 66. Ig Trizma hydrochloride (Sigma) and 9.7 g 
Trizma base (Sigma), pH 7.4) for 1 hour at room temperature, followed by an 
incubation with an anti-GP140 or GP-160 antibody for 1 hour. The blot was 
washed 3 times for 10 minutes with TBS-T (TBS with 0.05% Tween20), and 
incubated for 1 hour with an alkaline-phosphatase-anti- 
mouse/rabbit/monkey/human IgG conjugate. After washing 2 times for 10 
minutes with TBS-T and 2 times for 10 minutes with TBS, the development 
reaction was carried out using alkaline phosphatase detection reagents (10 ml AP 
buffer (100 mMTris/HCl, 100 mM NaCl, pH 9.5) with 50 jilNBT solution (7.7% 
Nitro Blue Tetrazolium (Sigma) in 70% dimethylformamide) and 37 jllI of 
X-Phosphate solution (5% of 5-bromo-4-chloro-3-indolyl phosphate in 
dimethylformamide) . 

EXAMPLE 28 
Purification of gpl40Cys 

[0363] An anti-gpl20 antibody was covalently coupled to aNHS/EDC activated 

dextran and packed into a chromatography column. The supernatant, containing 
GP 1 AOCys is loaded onto the column and after sufficient washing, GP 1 AOCys was 
eluted using 0.1 M HC1. The eluate was directly neutralized during collection 
using 1 M Tris pH 7.2 in the collection tubes. 

[0364] Disulfide bond formation might occur during purification, therefore the 

collected sample is treated with 10 mM DTT in 10 mM Tris pH 7.5 for 2 hours 
at25°C. 
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[0365] DTT is remove by subsequent dialysis against lOmMMes; 80 mMNaCl 

pH 6.0. Finally GP140Cys is mixed with alphavirus particles containing the JUN 
residue in E2 as described in Example 16. 

EXAMPLE 29 
Construction of PLA2-Cys 

[0366] The PLA2 gene was amplified by PGR from pAV3PLAfos using oligos 

EcoRIPLA and PLA-Cys-hind. ForthePCRs, 100 pmol of each oligo and 50 ng 
of the template DNAs were used in the 50 p.1 reaction mixtures with 2 units of 
Pwo polymerase, 0.1 mM dNTPs and 2 mM MgS04. For both reactions , 
temperature cycling was carried out as follows: 94 °C for 2 minutes; 30 cycles of 
94°C (0.5 minutes), 55°C (0.5 minutes), 72°C (2 minutes). 

[0367] The PGR product was purified using QiaEXII kit, digested with 

EcoRI/HinDIII and ligated into vector pAV3 cleaved with the same enzymes. 

[0368] Oligos 
EcoRIPLA: 

5 ' -T AACCGAATTC AGGAGGT AAAAAGAT ATGG-3 ' (SEQ ID NO:85) 
PLACys-hind: 

5 ' -GAAGT AAAGCTTTTAACC ACCGC AACC ACC AGAAG-3 5 (SEQ ID 
NO:86). 

EXAMPLE 30 
Expression and Purification of PLA-Cys 
[0369] For cytoplasmic production of Cys tagged proteins, E. coli XL- 1 -Blue 

strain was transformed with the vectors p AV3 : :PLA and pPLA-Cys. The culture 
was incubated in rich medium in the presence of ampicillin at 37 °C with shaking. 
At an optical density (550nm) of, 1 mM IPTG was added and incubation was 
continued for another 5 hours. The cells were harvested by centrifiigation, 
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resuspended in an appropriate buffer (e.g., Tris-HCl, pH 7.2, 150 raM NaCl) 
containing DNase, RNase and lysozyme, and disrupted by passage-through a 
french pressure cell. After centrifugation (Sorvall RC-5C, SS34 rotor, 
15000 rpm, 10 min, 4°C), the pellet was resuspended in 25 ml inclusion body 
wash buffer (20 mM tris-HCl, 23% sucrose, 0.5% Triton X-100, 1 mM EDTA, 
pH8) at 4° C and recentrifuged as described above. This procedure was repeated 
until the supernatant after centrifugation was essentially clear. Inclusion bodies 
were resuspended in 20 ml solubilization buffer (5.5 M guanidinium 
hydrochloride, 25 mM tris-HCl, pH 7.5) at room temperature and insoluble 
material was removed by centrifugation and subsequent passage of the supernatant 
through a sterile filter (0.45 jim). The protein solution was kept at 4°C for at 
least 10 hours in the presence of 10 mM EDTA and 100 mM DTT and then 
dialyzed three times against 10 volumes of 5.5 M guanidinium hydrochloride, 25 
mM tris-HCl, 10 mM EDTA, pH 6. The solution was dialyzed twice against 5 1 
2 M urea, 4 mM EDTA, 0.1 M NH 4 C1, 20 mM sodium borate (pH 8.3) in the 
presence of an appropriate redox shuffle (oxidized glutathione/reduced 
glutathione; cystine/cysteine). The refolded protein was then applied to an ion 
exchange chromatography. The protein was stored in an appropriate buffer with 
a pH above 7 in the presence of 2-10 mM DTT to keep the cysteine residues in 
a reduced form. Prior to coupling of the protein with the alphavirus particles, 
DTT was removed by passage of the protein solution through a Sephadex G-25 
gel filtration column. 



EXAMPLE 31 

Construction of a HBcAg devoid of free cysteine residues and containing 

an inserted lysine residue 
[0370] AHepatitis core Antigen (HBcAg), referred to herein as HBcAg-lys-2cys- 

Mut, devoid of cysteine residues at positions corresponding to 48 and 107 in SEQ 
ID NO: 134 and containing an inserted lysine residue was constructed using the 
following methods. 
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[0371] The two mutations were introduced by first separately amplifying three 

fragments of the HBcAg-Lys gene prepared as described above in Example 23 
with the following PCR primer combinations. PCR methods essentially as 
described in Example 1 and conventional cloning techniques were used to prepare 

the HBcAg-lys-2cys-Mut gene. , 

h 

[0372] In brief, the following primers were used to prepare fragment 1 : 

Primer 1: EcoRIHBcAg(s) 

CCGGAATTCATGGACATTGACCCTTATAAAG (SEQ ID NO: 148) 
Primer 2: 48 as 

GTGCAGTATGGTGAGGTGAGGAATGCTCAGGAGACTC (SEQ ID 
NO: 149) 

[0373] The following primers were used to prepare fragment 2: 

Primer 3 : 48s 

GSGTCTCCTGAGCATTCCTCACCTCACCATACTGCAC(SEQIDNO:150) 
Primer 4: 107as 

CTTCCAAAAGTGAGGGAAGAAATGTGAAACCAC (SEQ ID NO: 151) 

[0374] The following primers were used to prepare fragment 3 : 

Primer 5: HBcAgl49hind-as 

CGCGTCCCAAGCTTCTAAACAACAGTAGTCTCCGGAAGCGTTGATAG 
(SEQ ID NO: 152) 

Primer 6: 107s 

GTGGTTTCACATTTCTTCCCTCACTTTTGGAAG (SEQ ID NO: 153) 

[0375] Fragments 1 and 2 were then combined with PCR primers 

EcoRIHBcAg(s) and 107as to give fragment 4. Fragment 4 and fragment 3 were 
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then combined with primers EcoRIHBcAg(s) and HBcAgl49hind-as to produce 
the full length gene. The full length gene was then digested with the EcoRI 
(GAATTC) and Hindlll (AAGCTT) enzymes and cloned into the pKK vector 
(Pharmacia) cut at the same restriction sites. The amino acid sequence of the 
HBcAg-Lys-2cys-Mut polypeptide is MDIDPYKEFGATVELLSFL 
PSDFFPSVRDLLDTASALYREALESPEHSSPHHTALRQAILCWGELMTL 
ATWVGTNLEDGGKGGSRDLVVSYVNTNMGLKIRQLLWFHISSLTFGR 
ETVLEYLVSFGVWIRTPPAYRPPNAPILSTLPETTVV (SEQ ID NO: 186). 

EXAMPLE 32 

Blockage of free cysteine residues of a HBcAg followed by cross-linking 
[0376] The free cysteine residues of the HBcAg-Lys prepared as described above 

in Example 23 were blocked using Iodacetamide. The blocked HBcAg-Lys was 
then cross-linked to the FLAG peptide with the hetero-bifunctional cross-linker 
m-maleimidonbenzoyl-N-hydroxysuccinimide ester (Sulfo-MBS). 
[0377] The methods used to block the free cysteine residues and cross-link the 

HBcAg-Lys are as follows. HBcAg-Lys (550 |ug/ml) was reacted for 15 minutes 
at room temperature with Iodacetamide (Fluka Chemie, Brugg, Switzerland) at 
a concentration of 50 mM in phosphate buffered saline (PBS) (50 mM sodium 
phosphate, 150 mM sodium chloride), pH 7.2, in a total volume of 1 ml. The so 
modified HBcAg-Lys was then reacted immediately with Sulfo-MBS (Pierce) at 
a concentration of 530 |j.M directly in the reaction mixture of step 1 for 1 hour at 
room temperature. The reaction mixture was then cooled on ice, and dialyzed 
against 1000 volumes of PBS pH 7.2. The dialyzed reaction mixture was finally 
reacted with 300 |iM of the FLAG peptide (CGGDYKDDDDK (SEQ ID 
NO: 147)) containing an N-terminal free cysteine for coupling to the activated 
HBcAg-Lys, and loaded on SDS-PAGE for analysis. 
[0378] As shown in Figure 8, the resulting patterns of bands on the SDS-PAGE 

gel showed a clear additional band migrating slower than the control HBcAg-Lys 
derivatized with the cross-linker, but not reacted with the FLAG peptide. 



WO 01/85208 



PCT/IB01/00741 



-116- 

Reactions done under the same conditions without prior derivatization of the 
cysteines with Iodacetamide led to complete cross-linking of monomers of the 
HBcAg-Lys to higher molecular weight species. 

EXAMPLE 33 

Isolation of Type- 1 pili and chemical coupling of FLAG peptide to Type-1 pili 

of Escherichia coli using a heterobifiinctional cross-linker 
A. Introduction 

[0379] Bacterial pili or fimbriae are filamentous surface organelles produced by 

a wide range of bacteria. These organelles mediate the attachment of bacteria to 
surface receptors of host cells and are required for the establishment of many 
bacterial infections like cystitis, pyelonephritis, new born meningitis and diarrhea. 

[0380] Pili can be divided in different classes with respect to their receptor 

specificity (agglutination of blood cells from different species), their assembly 
pathway (extracellular nucleation, general secretion, chaperone/usher, alternate 
chaperone) and their morphological properties (thick, rigid pili; thin, flexible pili; 
atypical structures including capsule; curli; etc). Examples of thick, rigid pili 
forming a right handed helix that are assembled via the so called chaperone/usher 
pathway and mediate adhesion to host glycoproteins include Type- 1 pili, P-pili, 
S-pili, FIC-pili, and 987P-pili). The most prominent and best characterized 
members of this class of pili are P-pili and Type-1 pili (for reviews on adhesive 
structures, their assembly and the associated diseases see Soto, G. E. & Hultgren, 
S. J., J. BacterioL 757:1059-1071 (1999); Bullitt & Makowski, Biophys. J. 
74:623-632 (1998); Hung, D. L. & Hultgren, S. 1, J. Struct, Biol. 724:201-220 
(1998)). 

[0381] Type-1 pili are long, filamentous polymeric protein structures on the 

surface of E. coli. They possess adhesive properties that allow for binding to 
mannose-containing receptors present on the surface of certain host tissues. 
Type-1 pili can be expressed by 70-80% of all E. coli isolates and a single 7?. coli 
cell can bear up to 500 pili. Type- pili reach a length of typically 0.2 to 2 \iM with 
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an average number of 1000 protein subunits that associate to a right-handed helix 
with 3.125 subunits per turn with a diameter of 6 to 7 nm and a central hole of 2 . 0 
to 2.5 nm. 

[0382] The main Type-1 pilus component, FimA, which represents 98% of the 

total pilus protein, is a 15.8 kDa protein. The minor pilus components FimF, 
FimG and FimH are incorporated at the tip and in regular distances along the pilus 
shaft (Klemm, P. & Krogfelt, K. A, "Type I fimbriae of Escherichia coli" in: 
Fimbriae. Klemm, P. (ed.), CRC Press Inc., (1994) pp. 9-26). FimH, a 29. 1 kDa 
protein, was shown to be the mannose-binding adhesin of Type-1 pili (Krogfelt, 
K. A., et al, Infect. Immun. 55:1995-1998 (1990); Klemm, P., et al, Mol 
Microbiol 4:553-560 (1990); Hanson, M. S. & Brinton, C. C. J. ? Nature 17:265- 
268 (1988)), and its incorporation is probably facilitated by FimG and FimF 
(Klemm, P. & Christiansen, G,M?/. Gen. Genetics 205:439-445 (1987); Russell, 
P. W. & Orndorff, P. E., J. Bacteriol. 774:5923-5935 (1992)). Recently, it was 
shown that FimH might also form a thin tip-fibrillum at the end of the pili (Jones, 
C. H., et al, Proc. Nat. Acad. Set USA 92:2081-2085 (1995)). The order of 
major and minor components in the individual mature pili is very similar, indicating 
a highly ordered assembly process (Soto, G. E. & Hultgren, S. J., J. Bacteriol 
757:1059-1071 (1999)). 

[0383] P-pili of E. coli are of very similar architecture, have a diameter of 6. 8 nm, 

an axial hole of 1 .5 nm and 3.28 subunits per turn (Bullitt & Makowski, Biophys. 
J. 74:623-632 (1998)). The 16.6 kDa PapA is the main component of this pilus 
type and shows 36% sequence identity and 59% similarity to FimA (see Table 1). 
As in Type-1 pili the 36.0 kDa P-pilus adhesin PapG and specialized adapter 
proteins make up only a tiny fraction of total pilus protein. The most obvious 
difference to Type-1 pili is the absence of the adhesin as an integral part of the 
pilus rod, and its exclusive localization in the tip fibrillium that is connected to the 
pilus rod via specialized adapter proteins that Type-1 pili lack (Hultgren, S. J., et 
al, Cell 73:887-901 (1993)). 
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[0384] Table 1 : Similarity and identity between several structural pilus 

proteins of Type- 1 and P-pili (in percent). The adhesins 
were omitted. 



Similarity 





FimA 


PapA 


FimI 


FimF 


FimG 


PapE 


PapK 


PapH 


Pap] 


FiraA 




59 


57 


56 


44 


50 


44 


46 


46 


PapA 


36 




49 


48 


41 


45 


49 


49 


47 


FimI 


35 


31 




56 


46 


40 


47 


48 


48 


FimF 


34 


26 


30 




40 


47 


43 


49 


48 


FimG 


28 


28 


28 


26 




39 


39 


41 


45 


PapE 


25 


23 


18 


28 


22 




43 


47 


54 


PapK 


24 


29 


25 


28 


22 


18 




49 


53 


PapH 


22 


2 6/ . 


:• 22 


22 


23 


24 


23 




41 


PapF 


18 


22 


22 


24 


28 


27 


26 


21 





[0385] Type- 1 pili are extraordinary stable hetero-oligomeric complexes. Neither 

SDS-treatment nor protease digestions, boiling or addition of denaturing agents 
can dissociate Type-1 pili into their individual protein components. The 
combination of different methods like incubation at 100°C at pH 1.8 was initially 
found to allow for the depolymerization and separation of the components 
(Eshdat, Y., etaL, J. BacterioL 745:308-314 (1981); Brinton, C.C. I, Trans, N. 
Y. Acad Set 27: 1003-1054 (1965); Hanson, A. S., etaL, J. BacterioL, 770:3350- 
3358 (1988); Klemm, P. &Krogfelt, K. A., "Type I fimbriae of Escherichia coli," 
in: Fimbriae. Klemm, P. (ed.)> CRC Press Inc., (1994) pp. 9-26). Interestingly, 
Type-1 pili show a tendency to break at positions where FimH is incorporated 
upon mechanical agitation, resulting in fragments that present a FimH adhesin at 
their tips. This was interpreted as a mechanism of the bacterium to shorten pili to 
an effective length under mechanical stress (Klemm, P. & Krogfelt, K. A., "Type 
I fimbriae of Escherichia coli," in: Fimbriae, Klemm, P. (ed.), CRC Press Inc., 
(1994) pp. 9-26). Despite their extraordinary stability, Type-1 pili have been 
shown to unravel partially in the presence of 50% glycerol; they lose their helical 
structure knd form an extended and flexible, 2 nm wide protein chain (Abraham, 
S. N., etaL, J. BacterioL 774:5145-5148 (1992)). 
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[0386] P-pili and Type-1 pili are encoded by single gene clusters on theE. coli 

chromosome of approximately 10 kb (Klemm, P. & Krogfelt, K. A., "Type I 
fimbriae of Escherichia coli" in: Fimbriae. Klemm, P. (ed.), CRC Press Inc., 
(1994) pp. 9-26; Orndorff, P. E. & Falkow, S., J- Bacteriol 760:61-66 (1984)). 
A total of nine genes are found in the Type-1 pilus gene cluster, and 1 1 genes in 
the P-pilus cluster (Hultgren, S. X, etal, Adv. Prot Chem. 44:99-123 (1993)). 
Both clusters are organized quite similarly. 

[0387] The first two Jim-genes, fimB and fimE, code for recombinases involved 

in the regulation of pilus expression (McClain, M. S., et ah, J. Bacteriol 
773:5308-53 14 (1991)). The main structural pilus protein is encoded by the next 
gene of the cluster,//^ (Klemm, P., Euro. J. Biochem. J 43:395-400 (1984); 
Orndorff, P. E. & Falkow, S., J. Bacteriol 160:61-66 (1984); Orndorff, P. E. & 
Falkow, S., J. Bacteriol 7(52:454-457 (1985)). The exact role of fiml is unclear. 
It has been reported to be incorporated in the pilus as well (Klemm, P. & Krogfelt, 
K. A, "Type I fimbriae of Escherichia coli" in: Fimbriae. Klemm, P. (ed.), CRC 
Press Inc., (1994) pp. 9-26). The adjacent fimC codes not for a structural 
component of the mature pilus, but for a so-called pilus chaperone that is essential 
for the pilus assembly (Klemm, P., Res. Microbiol 745:831-838 (1992); Jones, 
C. H., etal, Proc. Nat. AcadSci. USA 90:8397-8401 (1993)). 

[0388] The assembly platform in the outer bacterial membrane to which the 

mature pilus is anchored is encoded by fimD (Klemm, P. & Christiansen, G., Mb/. 
Gen, Genetics 220:334-338 (1990)). The three minor components of the Type-1 
pili, FimF, FimG and FimH are encoded by the last three genes of the cluster 
(Klemm, P. & Christiansen, G.,Mol Gen. Genetics 205:439-445 (1987)). Apart 
from fimB and fimE, all genes encode precursor proteins for secretion into the 
periplasm via the sec-pathway. 

[0389] The similarities between different pili following the chaperone/usher 

pathway are not restricted to their morphological properties. Their genes are also 
arranged in a very similar manner. Generally the gene for the main structural 
subunit is found directly downstream of the regulatory elements at the beginning 
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of the gene cluster, followed by a gene for an additional structural subunit (find 
in the case of Type- 1 pili and papH in the case of P-pili). PapH was shown and 
Fiml is supposed to terminate pilus assembly (Hultgren, S. J. , et al, Cell 73:887- 
901 (1993)). The two proteins that guide the process of pilus formation, namely 
the specialized pilus chaperone and the outer membrane assembly platform, are 
located adjacently downstream. At the end of the clusters a variable number of 
minor pilus components including the adhesins are encoded. The similarities in 
morphological structure, sequence (see Table 1), genetic organization and 
regulation indicate a close evolutionary relationship and a similar assembly process 
for these cell organelles. 

[0390] Bacteria producing Type-1 pili show a so-called phase-variation. Either 

the bacteria are fully piliated or bald. This is achieved by an inversion of a 3 14 bp 
genomic DNA fragment containing the fimA promoter, thereby inducing an "all 
on" or "all oflP' expression of the pilus genes (McClain, M. S., et al, J. Bacterial. 
773:5308-5314 (1991)). The coupling of the expression of the other structural 
pilus genes to fimA expression is achieved by a still unknown mechanism. 
However, a wide range of studies elucidated the mechanism that influences the 
switching between the two phenotypes. ' 

[0391] The first two genes of the Type-1 pilus cluster, fimB and fimE encode 

recombinases that recognize 9 bp DNA segments of dyad symmetry that flank the 
invertable fimA promoter. Whereas FimB switches pilation "on", FimE turns the 
promoter in the "off 5 orientation. The up- or down-regulation of either fimB or 
fimE expression therefore controls the position of the so-called '///^-switch" 
(McClain, M. S., etal, J. Bacteriol 773:5308-5314 (1991); Blomfield, I. C, et 
al, J* Bacteriol 773:5298-5307 (1991)). 

[0392] The two regulatory proteins fimB and/zm£ are transcribed from distinct 

promoters and their transcription was shown to be influenced by a wide range of 
different factors including the integration host factor (IHF) (Blomfield, I. C, et 
ah, Mol Microbiol 23:705-717 (1997)) and the leucine-responsive regulatory 
protein (LRP) (Blomfield, I. C, etal, J. Bacteriol 775:27-36 (1993); Gaily, D. 
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L., et al, J. Bacteriol 775:6186-6193 (1993); Gaily, D. L., et al, Microbiol 
27:725-738 (1996); Roesch, R. L. &Blomfield, I. C.,Mo/. Microbiol 27:751-761 
(1998)). Mutations in the former lock the bacteria either in "on" or "off 5 phase, 
whereas LRP mutants switch with a reduced frequency. In addition, an effect of 
leuX on pilus biogenesis has been shown. This gene is located in the vicinity of 
the ///77-genes on the chromosome and codes for the minor leucine tRNA species 
for the UUG codon. Whereas fimB contains five UUG codons,fimE contains 
only two, and enhanced leuX transcription might favor FimB over FimE 
expression (Burghoff, R. L., et al, Infect Immun. 61: 1293-1300 (1993); 
Newman, J. V., etal, FEMS Microbiol Lett 722:281-287 (1994); Ritter, A., et 
al, Mol Microbial, 25:871-882 (1997)). 
[0393] Furthermore, temperature, medium composition and other environmental 

factors were shown to influence the activity of FimB and FimE. Finally, a 
spontaneous, statistical switching of Xh&fimA promoter has been reported. The 
frequency of this spontaneous switching is approximately 10" 3 per generation 
(Eisenstein, B. I., Science 274:337-339 (1981); Abraham, S. M., etal, Proc. Nat 
Acad. Sci, USA 52:5724-5727 (1985)), but is strongly influenced by the above 
mentioned factors. 

[0394] The genes fiml and fimC are also transcribed from the fimA promoter, but 

directly downstream of fimA a DNA segment with a strong tendency to form 
secondary structure was identified which probably represents a partial 
transcription terminator (Klemm, P., Euro. J. Biochem. 743:395-400 (1984)); and 
is therefore supposed to severely reduce//ra/ and fimC transcription. At the 3' 
end of fimC an additional promoter controls the fimD transcription; at the 3 ' end 
of fimD the last known flm promoter is located that regulates the levels of FimF, 
FimG, and FimH. Thus, all of the minor Type-1 pili proteins are transcribed as a 
single mRNA (Klemm, P. & Krogfelt, K. A., "Type I fimbriae of Escherichia 
coli," in: Fimbriae. Klemm, P. (ed.), CRC Press Inc., (1994) pp. 9-26). This 
ensures a 1 : 1 : 1 stochiometry on mRNA-level, which is probably maintained on the 
protein level. 
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[0395] In the case of P-pili additional regulatory mechanisms were found when 

the half-life of mRNA was determined for different P-pilus genes. The mRNA for 
papA was extraordinarily long-lived, whereas the mRNA for papB, a regulatory 
pilus protein, was encoded by short-lived mRNA (Naureckiene, S . & Uhlin. BE., 
Mol Microbiol 27:55-68 (1996); Nilsson, P., etal, J. Bacterial 775:683-690 
(1996)). 

[0396] In the case of Type- 1 pili, the gene for the Type-1 pilus chaperone FimC 

starts with a GTG instead of an ATG codon, leading to a reduced translation 
efficiency. Finally, analysis of the fimH gene revealed a tendency of the fimH 
mRNA to form a stem-loop, which might severely hamper translation. In 
summary, bacterial pilus biogenesis is regulated by a wide range of different 
mechanisms acting on all levels of protein biosynthesis. 

[0397] Periplasmic pilus proteins are generally synthesized as precursors, 

containing aN-terminal signal-sequence that allows translocation across the inner 
membrane via the Sec-apparatus. After translocation the precursors are normally 
cleaved by signal-peptidase I. Structural Type-1 pilus subunits normally contain 
disulfide bonds, their formation is catalyzed by DsbA and possibly DsbC and 
DsbG gene products. 

[0398] The Type- 1 pilus chaperone FimC lacks cysteine residues. In contrast, the 

chaperone of P-pili, PapD, is the only member of the pilus chaperone family that 
contains a disulfide bond, and the dependence of P-pili on DsbA has been shown 
explicitly (Jacob-Dubuisson, F., etal, Proc. Nat. Acad. Sci. USA 91 : 1 1552-1 1556 
(1994)). PapD does not accumulate in the periplasm of zAdsbA strain, indicating 
that the disturbance of the P-pilus assembly machinery is caused by the absence 
of the chaperone (Jacob-Dubuisson, F., et al, Proc. Nat. Acad. Set USA 
91:1 1552-1 1556 (1994)). This is in accordance with the finding that Type-1 pili 
are still assembled m&AdsbA strain, albeit to reduced level (Hultgren, S. J., et al., 
"Bacterial Adhesion and Their Assembly", in: Escherichia coli and Salmonella, 
Neidhardt, F. C. (ed.) ASM Press, (1996) pp. 2730-2756). 
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[0399] Type-1 pili as well as P-pili are to 98% made of a single or main structural 

subunit termed FimA and PapA, respectively. Both proteins have a size of -15.5 
kDa. The additional minor components encoded in the pilus gene clusters are very 
similar (see Table 1). The similarities in sequence and size of the subunits with the 
exception of the adhesins suggest that all share an identical folding motif, and 
differ only with respect to their affinity towards each other. Especially the N- and 
C-terminal regions of these proteins are well conserved and supposed to play an 
important role in chaperone/subunit interactions as well as in subunit/subunit 
interactions within the pilus (Soto, G. E. & Hultgren, S. J., J. Bacteriol 
181: 1059-1071 (1999)). Interestingly, the conserved N-terminal segment can be 
found in the middle of the pilus adhesins, indicating a two-domain organization of 
the adhesins where the proposed C-terminal domain, starting with the conserved 
motif, corresponds to a structural pilus subunit whereas the N-terminal domain 
was shown to be responsible for recognition of host cell receptors (Hultgren, S. 
J., etal, Proa Nat Acad Set USA §6:4357-4361 (1989); Haslam, D. R., etal, 
Mol Microbiol 74:399-409 (1994); Soto, G. E., etal., EMBOJ. 77:6155-6167 
(1998)). The different subunits were also shown to influence the morphological 
properties of the pili. The removal of several genes was reported to reduce the 
number of Type-1 or P-pili or to increase their length, (fimH, papG, papK,fimF f 
fimG) (Russell, P. W. & Orndorff, P. E., J. Bacteriol 774:5923-5935 (1992); 
Jacob-Dubuisson, R., et al, EMBO J. 72:837-847 (1993); Soto, G. E. & 
Hultgren, S. I, J. Bacteriol 757:1059-1071 (1999)); combination of the gene 
deletions amplified these effects or led to a total loss of pilation (Jacob-Dubuisson, 
R., etal, EMBO J. 72:837-847 (1993)). 

[0400] In non-fimbrial adhesive cell organelles also assembled via 

chaperones/usher systems such as Myf fimbriae and CS3 pili, the conserved C~ 
terminal region is different. This indirectly proves the importance of these C- 
terminal subunit segments for quaternary interactions (Hultgren, S. J., et al., 
"Bacterial Adhesion and Their Assembly", in: Escherichia coli and Salmonella, 
Neidhardt, F. C. (ed.) ASM Press, (1996) pp. 2730-2756). 



WO 01/85208 



PCT/IB01/00741 



-124- 



[0401] Gene deletion studies proved that removal of the pilus chaperones leads 

to a total loss of piliation in P-pili and Type-1 pili (Lindberg, F., et aL, J. 
Bacterial. 777:6052-6058 (1989); Klemm, P., Res. Microbiol. 743:831-838 
(1992); Jones, C. H., et aL, Proc. Nat. Acad Sci. USA 90:8397-8401 (1993)). 
Periplasmic extracts of a AfimC strain showed the accumulation of the main 
subunit FimA, but no pili could be detected (Klemm, P., Res. Microbiol. 743:83 1- 
838 (1992)). Attempts to over-express individual P-pilus subunits failed and only 
proteolytically degraded forms could be detected in the absence of PapD; in 
addition, the P-pilus adhesin was purified with the inner membrane fraction in the 
absence of the chaperone (Lindberg, F., et al., J. Bacteriol. 777:6052-6058 
(1989)). However, co-expression of the structural pilus proteins and their 
chaperone allowed the detection of chaperone/subunit complexes from the 
periplasm in the case of the FimC/FimH complex as well as in the case of different 
Pap-proteins including the adhesin PapG and the main subunit Pap A (Tewari, R., 
et al, J. Biol. Chem. 255:3009-3015 (1993); Lindberg, F., et al, J. Bacteriol 
777:6052-6058 (1989)). The affinity of chaperone/subunit complexes towards 
their assembly platform has also been investigated in vitro and was found to differ 
strongly (Dodson etal, Proc. Natl Acad. Sci. USA 90361 0-361 A (1993)). From 
these results the following functions were suggested for the pilus chaperones: 

[0402] They are assumed to recognize unfolded pilus subunits, prevent their 

aggregation and to provide a "folding template" that guides the formation of a 
native structure. 

[0403] The folded subunits, which after folding display surfaces that allow 

subunit/subunit interactions, are then expected to be shielded from interacting with 
other subunits, and to be kept in a monomeric, assembly-competent state. 

[0404] Finally, the pilus chaperones are supposed to allow a triggered release of 

the subunits at the outer membrane assembly location, and, by doing so with 
different efficiency, influence the composition and order of the mature pili (see 
also the separate section below). 
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[0405] After subunit release at the outer membrane, the chaperone is free for 

another round of substrate binding, folding assistance, subunit transport through 
the periplasm and specific delivery to the assembly site. Since the periplasm lacks 
energy sources, like ATP, the whole pilus assembly process must be 
thermodynamically driven (Jacob-Dubuisson, ¥.,etaL, Proc. Nat Acad. Set USA 
91: 1 1552-1 1556 (1994)). The wide range of different functions attributed to the 
pilus chaperones would implicate an extremely fine tuned cascade of steps. 

[0406] Several findings, however, are not readily explained with the model of 

pilus chaperone function outlined above. One example is the existence of 
multimeric chaperone/subunit complexes (Striker, R. T., et aL, J. Biol Chem. 
2(59:12233-12239 (1994)), where one chaperone binds subunit dimers or trimers. 
It is difficult to imagine a folding template that can be "double-booked". The 
studies on the molecular details of chaperone/subunit interaction (see below) 
partially supported the functions summarized above, but also raised new 
questions. 

[0407] All 3 1 periplasmic chaperones identified by genetic studies or sequence 

analysis so far are proteins of approximately 25 kDa with conspicuously high pi 
values around 10. Ten of these chaperones assist the assembly of rod-like pili, 
four are involved in the formation of thin pili, ten are important for the biogenesis 
of atypically thin structures (including capsule-like structures) and two adhesive 
structures have not been determined so far (Holmgren, A., et aL, EMBO J. 
77:1617-1622 (1992); Bond, A, et aL, J. MoL Evolution 44:299-309 (1997); 
Smyth, C. J., et aL, FEMSImmun. MedMicrobioL 7(5:127-139 (1996); Hung, D. 
L. &Hultgren, S. J., J. Struct, Biol. 124:201-220 (1998)). The pairwise sequence 
identity between these chaperones and PapD ranges from 25 to 56%, indicating 
an identical overall fold (Hung, D. L., etaL, EMBO J. 75:3792-3805 (1996)). 

[0408] The first studies on the mechanism of chaperone/substrate recognition was 

based on the observation that the C-termini of all known pilus chaperones are 
extremely similar. Synthetic peptides corresponding to the C-termini of the P- 
pilus proteins were shown to bind to PapD in ELISA assays (Kuehn, M. J., et aL, 
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Science 262:1234-1241 (1993)). Most importantly, the X-ray structures of two 
complexes were solved in which PapD was co-crystallized with 19-residue 
peptides corresponding to the C-termini of either the adhesin PapG or the minor 
pilus component PapK (Kuehn, M. J., et al t Science 252:1234-1241 (1993); 
Soto, G. E., etal, EMBO J. 77:6155-6167 (1998)). Both peptides bound in an 
extended conformation to a P-strand in the N-terminal chaperone domain that is 
oriented towards the inter-domain cleft, thereby extending a (3-sheet by an 
additional strand. The C-terminal carboxylate groups of the peptides were 
anchored via hydrogen-bonds to Arg8 and Lysll2, these two residues are 
invariant in the family of pilus chaperones. Mutagenesis studies confirmed their 
importance since their exchange against alanine resulted in accumulation of non- 
functional pilus chaperone in the periplasm (Slonim, L. N., et al, EMBO J. 
77:4747-4756 (1992)). The crystal structure of PapD indicates that neither Axg8 
nor Lysll2 is involved in stabilization of the chaperone, but completely solvent 
exposed (Holmgren, A. & Branden, C. L, Nature 342:248-251 (1989)). On the 
substrate side the exchange of C-terminal PapA residues was reported to abolish 
P-pilus formation, and similar experiments on the conserved C-terminal segment 
of the P-pilus adhesin PapG prevented its incorporation into the P-pilus (Hultgren, 
S. J., et aL, "Bacterial Adhesion and Their Assembly", in: Escherichia coli and 
Salmonella, Neidhardt, F. C. (ed.) ASM Press, (1996) pp. 2730-2756). All 
evidence therefore indicated pilus subunit recognition via the C-terminal segments 
of the subunit s. 

[0409] A more recent study on C-terminal amino acid exchanges of the P-pilus 

adhesin PapG gave a more detailed picture. A range of amino acid substitutions 
at the positions -2, -4, -6, and -8 relative to the C-terminus were tolerated, but 
changed pilus stability (Soto, G. E., etal, EMBO X 77:6155-6167 (1998)). 

[0410] Still, certain problems arise when this model is examined more closely. 

Adhesive bacterial structures not assembled to rigid, rod-like pili lack the 
conserved C-terminal segments (Hultgren, S. J., et al., "Bacterial Adhesion and 
Their Assembly", in: Escherichia coli and Salmonella, Neidhardt, F. C. (ed.) 
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ASM Press, (1996) pp. 2730-2756), even though they are also dependent on the 
presence of related pilus chaperones. This indicates a different general role for the 
C-terminal segments of pilus subunits, namely the mediation of quaternary 
interactions in the mature pilus. Moreover, the attempt to solve the structure of 
a C-terminal peptide in complex with the chaperone by NMR was severely 
hampered by the weak binding of the peptide to the chaperone (Walse, B.,etaL, 
FEES Lett 472:115-120 (1997)); whereas an essential contribution of the C- 
terminal segments for chaperone recognition implies relatively high affinity 
interactions. 

[0411] An additional problem arises if the variability between the different 

subunits are taken into account. Even though the C-terminal segments are 
conserved, a wide range of conservative substitutions is found. For example, 1 5 
out of 1 9 amino acid residues differ between the two peptides co-crystallized with 
PapD (Soto, G. E., et al., EMBO J. 77:6155-6167 (1998)). This has been 
explained by the kind of interaction between chaperone and substrate, that occurs 
mainly via backbone interactions and not specifically via side-chain interactions. 
Then again, the specificity of the chaperone for certain substrates is not readily 
explained. On the contrary to the former argument, the conserved residues have 
been taken as a proof for the specificity (Hultgren, S. J., et al., "Bacterial 
Adhesion and Their Assembly", in: Escherichia coli and Salmonella, Neidhardt, 
F. C. (ed.) ASM Press, (1996) pp. 2730-2756). 

[0412] The outer membrane assembly platform, also termed "usher" in the 

literature, is formed by homo-oligomers of FimD or PapC, in the case of Type- 1 
and P-pili 3 respectively (Klemm, P. & Christiansen, G., Mol. Gen, Genetics 
220:334-338 (1990); Thanassi, D. G., etal, Proc. Nat. Acad. Set USA P5:3146~ 
3151 (1998)). Studies on the elongation of Type-1 fimbriae by electron 
microscopy demonstrated an elongation of the pilus from the base (Lowe, M. A., 
etal, J. Bacterial 7(59:157-163 (1987)). In contrast to the secretion of unfolded 
subunits into the periplasmic space, the fully folded proteins have to be 
translocated through the outer membrane, possibly in an oligomeric form 
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(Thanassi, D. G, etaL, Proc. Nat. Acad. Set USA £5:3146-3151 (1998)). This 
requires first a membrane pore wide enough to allow the passage and second a 
transport mechanism that is thermo dynamically driven (Jacob-Dubuisson, F., et 
al., J. Biol. Chem. 2(59:12447-12455 (1994)). 

[0413] FimD expression alone was shown to have a deleterious effect on bacterial 

growth, the co-expression of pilus subunits could restore normal growth behavior 
(Klemm, P. & Christiansen, G,Mo/. Gen, Genetics 220:334-338 (1990)). Based 
on this it can be concluded that the ushers probably form pores that are completely 
filled by the pilus. Electron microscopy on membrane vesicles in which PapC had 
been incorporated confirmed a pore-forming structure with an inner diameter of 
2 nm (Thanassi, D. G, etaL, Proc. Nat Acad. Set USA 95:3146-3151 (1998)). 
Since the inner diameter of the pore is too small to allow the passage of a pilus 
rod, it has been suggested that the helical arrangement of the mature pilus is 
formed at the outside of the bacterial surface. The finding that glycerol leads to 
unraveling of pili which then form a protein chain of approximately 2 nm is in 
good agreement with this hypothesis, since an extended chain of subunits might 
be formed in the pore as a first step (Abraham, S. N., et al., J. Bacteriol. 
774:5145-5148 (1992); Thanassi, D. G., et al, Proc. Nat. Acad. Set USA 
953 146-3 1 5 1 (1998)). The formation of the helical pilus rod at the outside of the 
bacterial membrane might then be the driving force responsible for translocation 
of the growing pilus through the membrane. 

[0414] It has also been demonstrated that the usher proteins of Type- 1 and P-pili 

form ternary complexes with chaperone/ subunit complexes with different affinities 
(Dodson, K. W., etaL, Proc. Nat. Acad. Sci. USA 90:3670-3674 (1993); Saulino, 
E. T., etaL, EMBO J. 77:2177-2185 (1998)). This was interpreted as "kinetic 
partitioning" that allows a defined order of pilus proteins in the pilus. Moreover, 
it has been suggested that structural proteins might present a binding surface only 
compatible with one other type of pilus protein; this would be another mechanism 
to achieve a highly defined order of subunits in the mature pilus (Saulino, E. T., 
etaL, EMBO J. 77:2177-2185 (1998)). 
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B. Production of Type- 1 pili from Escherichia coli 

[0415] E. coli strain W3110 was spread on LB (10 g/L tryptone, 5 g/L yeast 

extract, 5 g/L NaCl, pH 7.5, 1 % agar (w/v)) plates and incubated at 37°C 
overnight. A single colony was then used to inoculate 5 ml of LB starter culture 
(10 g/L tryptone, 5 g/L yeast extract, 5 g/L NaCl, pH 7.5). After incubation for 
24 hours under conditions that favor bacteria that produce Type-1 pili (37°C, 
without agitation) 5 shaker flasks containing 1 liter LB were inoculated with one 
milliliter of the starter culture. The bacterial cultures were then incubated for 
additional 48 to 72 hours at 37°C without agitation. Bacteria were then harvested 
by centrifugation (5000 rpm, 4°C, 10 minutes) and the resulting pellet was 
resuspended in 250 milliliters of 10 mM Tris/HCl, pH 7.5. Pili were detached 
from the bacteria by 5 minutes agitation in a conventional mixer at 17.000 rpm. 
After centrifugation for 10 minutes at 10,000 rpm at 4°C the pili containing 
supernatant was collected and 1 M MgC12 was added to a final concentration of 
100 mM. The solution was kept at 4°C for 1 hour, and the precipitated pili were 
then pelleted by centrifugation (10,000 rpm, 20 minutes, 4°C). The pellet was 
then resuspended in 10 mM HEPES, pH 7.5, and the pilus solution was then 
clarified by a final centrifugation step to remove residual cell debris. 

C. Coupling of FLAG to purified Type-1 pili of E. coli using m- 
Maleimidonbenzoyl-N-hydroxysulfosuccinimide ester (sulfo-MBS) 

[0416] 600 (il of a 95% pure solution of bacterial Type-1 pili (2 mg/ml) were 

incubated for 30 minutes at room temperature with the heterobifunctional 
cross-linker sulfo-MBS (0.5 mM). Thereafter, the mixture was dialyzed overnight 
against 1 liter of 50 mM Phosphate buffer (pH7,2) with 150mMNaClto remove 
free sulfo-MBS. Then 500 |il of the derivatized pili (2 mg/ml) were mixed with 
0.5 mM FLAG peptide (containing an amino-terminal Cysteine) in the presence 
of 10 mM EDTA to prevent metal-catalyzed sufhydryloxidation. The non- 
coupled peptide was removed by size-exclusion-chromatography. 
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[0417] Figure 9 depicts an analysis of coupling of the FLAG peptide to type-1 

bacterial pili by SDS-P AGE. Lane 1 shows the unreacted pili subunitFimA. Lane 
3 shows the purified reaction mixture of the pili with the FLAG peptide. The 
upper band corresponds to the coupled product, while the lower band corresponds 
to the unreached subunit. 

EXAMPLE 34 
Construction of an expression plasmid for 
the expression of Type- 1 pili of Escherichia coli 

[0418] The DNA sequence disclosed in GenBank Accession No. U14003, 

the entire disclosure of which is incorporated herein by reference, contains all of 
the Escherichia coli genes necessary for the production of type-1 pili from 
nucleotide number 233947 to nucleotide number 240543 (the fim gene cluster). 
This part of the sequences contains the sequences for the genes fimA,fimI 7 fimC, 
firnD, fimV, fimG, and firnH. Three different PCRs were employed for the 
amplification of this part of the E. coli genome and subsequent cloning into 
pUC19 (GenBank Accession Nos. L09137 and X02514) as described below. 

[0419] The PGR template was prepared by mixing 10 ml of a glycerol stock of the 

E. coli strain W3110 with 90 ml of water and boiling of the mixture for 10 
minutes at 95 °C, subsequent centrifugation for 10 minutes at 14,000 rpm in a 
bench top centrifuge and collection of the supernatant. 

[0420] Ten ml of the supernatant were then mixed with 50 pmol of a PCR primer 

one and 50 pmol of a PCR primer two as defined below. Then 5 ml of a 1 OX PCR 
buffer, 0.5 ml of Taq-DNA-Polymerase and water up to a total of 50 ml were 
added. All PCRs were carried out according to the following scheme: 94°C for 
2 minutes, then 30 cycles of 20 seconds at 94°C, 30 seconds at 55°C, and 2 
minutes at 72°C. The PCR products were then purified by 1% agarose gel- 
electrophoresis. 

[0421] Oligonucleotides with the following sequences with were used to amplify 

the sequence from nucleotide number 233947 to nucleotide number 235863, 
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comprising the Jim A, fim\ and JimC genes: TAGATGATTACGCCAAGC 
TTATAATAGAAATAGTTTTTTGAAAGGAAAGCAGCATG (SEQ ID 
NO: 196) and GTCAAAGGCCTTGTCGACGTTATTCCATTACGCCCGTC 
ATTTTGG (SEQ ID NO: 197). 

[0422] These two oligonucleotides also contained flanking sequences that allowed 

for cloning of the amplification product into pucl 9 via the restriction sites HindLII 
and Sail. The resulting plasmid was termed pFIMAIC (SEQ ID NO: 198). 

[0423] Oligonucleotides with the following sequences with were used to amplify 

the sequence from nucleotide number 235654 to nucleotide number 238666, 
comprising the JimD gene: AAGATCTTAAGCTAAGCTTGAATTCTC 
TGACGCTGATTAACC (SEQ ID NO: 199) and ACGTAAAGCATTTCT 
AGACCGCGGATAGTAATCGTGCTATC (SEQ ID NO:200). 

[0424] These two oligonucleotides also contained flanking sequences that allowed 

for cloning of the amplification product into puc 1 9 via the restriction sites Hindi!! 
and Xbal, the resulting plasmid was termed pFIMD (SEQ ID NO:201). 

[0425] Oligonucleotides with the following sequences with were used to amplify 

the sequence from nucleotide number 238575 nucleotide number 240543, 
comprising the fwiF, JimG, and JimU gene: AATTACGTGAGCA 
AGCTTATGAGAAACAAACCTTTTTATC (SEQ IDNO:202) and GACTAAG 
GCCTTTCTAGATTATTGATAAACAAAAGTCACGC (SEQ ID NO.203). 

[0426] These two oligonucleotides also contained flanking sequences that allowed 

for cloning of the amplification product into puc 1 9 via the restriction sites Hindlll 
and Xbal\ the resulting plasmid was termed pFIMFGH. (SEQ ID NO:204). 

[0427] The following cloning procedures were subsequently carried out to 

generate a plasmid containing all the above-mentioned/zm-genes: pFIMAIC was 
digested EcoBl and Hindlll (2237-3982), pFIMD was digested EcoRl and Sstll 
(2267-5276), pFIMFGH was digested Sstll and Hindlll (2327-2231). The 
fragments were then ligated and the resulting plasmid, containing all the //m-genes 
necessary for pilus formation, was termed pFIMAICDFGH (SEQ ID NO:205). 
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EX AMPLE 35 

Construction of an expression plasmid for Escherichia coli type-1 pili that lacks 

the adhesion FimH 

[0428] The plasmid pFIMAICDFGH (SEQ ID NO :205) was digested with Kpnl, 

after which a fragment consisting of nucleotide numbers 8895-8509 was isolated 
by 0.7% agarose gelelectrophoresis and circularized by self-ligation. The resulting 
plasmid was termed pFIMAICDFG (SEQ ID NO: 206), lacks the fimH gene and 
can be used for the production of FIMH-free type-1 pili. 

EXAMPLE 36 
Expression of type-1 pili using the plasmid pFIMAICDFGH 
[0429] E. coli strain W3110 was transformed with pFIMAICDFGH (SEQ ID 

NO:205) and spread on LB (10 g/L tryptone, 5 g/L yeast extract, 5 g/L NaCl, pH 
7.5, 1 % agar (w/v)) plates containing 100 |ug/ml ampicillin and incubated at 37 °C 
overnight. A single colony was then used to inoculate 50 ml of LB-glucose starter 
culture (10 g/L tryptone, 5 g/L yeast extract, 1% (w/v) glucose, 5 g/L NaCl, pH 
7.5,1 OOmg/ml ampicillin) . After incubation for 12-16 hours at37°Catl50 rpm, 
a 5 liter shaker flasks containing 2 liter LB-glucose was inoculated with 20 
milliliter of the starter culture. The bacterial cultures were then incubated for 
additional 24 hours at 37 °C with agitation (150 rpm). Bacteria were then 
harvested by centrifugation (5000 rpm, 4°C, 10 minutes) and the resulting pellet 
was resuspended in 250 milliliters of 10 mM Tris/HCl, pH 8. Pili were detached 
from the bacteria by agitation in a conventional mixer at 1 7,000 rpm for 5 minutes. 
After centrifugation for 10 minutes at 10,000 rpm, 1 hour, 4°C the supernatant 
containing pili was collected and 1 M MgCl 2 was added to a final concentration 
of 100 mM. The solution was kept at 4 °C for 1 hour, and precipitated pili were 
then pelleted by centrifugation (10,000 rpm, 20 minutes, 4°C). The pellet was 
then resuspended in 10 mM HEPES, 30 mM EDTA, pH 7.5, for 30 minutes at 
room temperature, and the pilus solution was then clarified by a final 
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centrifugation step to remove residual cell debris. The preparation was then 
dialyzed against 20 mM HEPES, pH 7.4. 

EXAMPLE 37 
Activation of HBcAg-Lys with SPDP 
[0430] HBcAg-Lys at a concentration of 1 5 \xM was reacted with SPDP at a 

concentration of 456 \iM SPDP for 60 minutes at room temperature, resulting in 
a thirty-fold excess of cross-linker over capsid subunit. The reaction mixture was 
subsequently loaded on SDS-PAGE for analysis, as shown in Fig. 10. The gel 
shows that the monomer subunits are cross-linked to dimers and higher-order 
polymers during the reaction. 

EXAMPLE 38 

Multimerization of HBcAg-Lys Upon Reaction With Sulfo-MBS 
[0431] HBcAg-Lys at a concentration of 1 1 8 jiM was reacted with 20 mM Sulfo- 

MBS for 30 minutes at room temperature. As shown in Fig. 1 1, analysis of the 
reaction mixture by SDS-PAGE revealed that the HBcAg-Lys monomers 
internally cross-linked to multimers, as reflected in the absence of a band 
corresponding to the subunit monomer after cross-linking. 

EXAMPLE 39 
Conjugation of HBcAg-Lys-2cys Mut to the FLAG Peptide 
[0432] HB c Ag-Ly s-2 cy s -Mut at a concentration of 80 |uM was reacted with sulfa- 

MBS at a concentration of 8.8 mM for 30 minutes at room temperature, resulting 
in a 1 10-fold excess of cross-linker over capsid subunit. The reaction mixture was 
precipitated two times with 50% ammoniumsulfate and resuspended in 20 mM 
Hepes, 150 mM NaCl, pH 7.4, in a volume equivalent to the reaction volume 
before precipitation. FLAG peptide containing an N-terminal cysteine was added 
at a concentration of 1.6 mM and the reaction was allowed to proceed for four 
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hours at room temperature. The reaction mixture was subsequently loaded on 
SDS-PAGE for analysis, and the coupling products are shown in Fig. 12. 

EXAMPLE 40 
Conjugation of Pili to the p33 Peptide 
[0433] A solution of 1 ml pili at a concentration of 1.5 mg/ml (concentration of 

the subunit) was reacted with 750 |ul of a 100 mM Sulfo-MBS solution in 20 mM 
Hepes, pH 7.4, for 45 minutes at room temperature. The reaction mixture was 
desalted over a Sephadex G25 column equilibrated with 20 mM Hepes, pH 7.4. 
Fractions containing pili protein were pooled after analysis by dot blot stained with 
amidoblack, and 0.6 |ul of a solution of 100 mM p33 peptide 
(CGGKAVYNFATM, SEQ ID NO: 175), containing anN-terminal cysteine, in 
DMSO was added to 100 jjiI of the desalted activated pili and reaction allowed to 
proceed for four hours at room temperature. The reaction mixture was 
subsequently analyzed by SDS-PAGE, as shown in Fig. 13. 

EXAMPLE 41 
Expression of HBcAg-Lys-2cys-Mut 
[0434] The plasmid coding for HBcAg~Lys-2cys-Mut was transformed into E. 

coli K802. A single colony was inoculated into 50 ml LB containing 100 mg/ml 
ampicillin. The next day, the overnight culture was diluted into 2 L LB medium 
containing 100 mg/ml ampicillin and grown until ID 600 = 0.6 at 37°C. Cells were 
induced with 1 mM IPTG, and grown for another 4 hours at 37 °C. The cells 
were then harvested, and the pellet resuspended in 5 ml of 10 mM Na 2 HP0 4 , 03 
mM NaCl, 10 mM EDTA, 0.25% Tween, pH 7.0. Cells were then disrupted by 
sonification, and ammoniumsulfate was added to a concentration of 20%. The 
pellet was resuspended in 3 ml PBS buffer, and loaded onto a Sephacryl S-400 
column. The protein peak containing the capsid protein corresponding to the size 
of assembled capsid was collected and loaded onto a hydoxyapatite column for 
subsequent purification. The protein was eluted in the paththrough fraction. 
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EXAMPLE 42 

Coupling of DPI 78c peptide, immunization of mice and determination 

of the IgG subtypes 

[0435] DP 178c peptide is a fragment of the gp41 protein of HIV virus (Kilby, 

J.M. etaL, Nature Medicine 4: 1302-07 (1998)); Wild, C. etaL, Aids Res. Hum. 
Retroviruses 9: 1051-53 (1993)). 

A. Coupling of DP 1 78c to Pili 

[0436] A solution of 3 ml Pili (2.5 mg/ml) produced as described in Example 33 

B was reacted with 500 \A of a 100 rnM Sulfo-MBS solution for 45 minutes at 
RT. The reaction mixture was desalted on a Sephadex G25 column equilibrated 
with 20 mM hepes pH 7.4, and fractions containing pili were pooled. An aliquot 
of 750 /A of the activated pili was diluted in 750 /A DMSO, and 2-5 jA of a 100 
mM DP 178c solution in DMSO was added. The reaction was left to react 4 hours 
at RT, and glucose was added to the reaction mixture to give a final concentration 
of 0.2%. This solution was then dialyzed against 20 mM Hepes, 0. 1% glucose, 
pH 7.4. The dialyzed coupled pili were centrifuged and loaded on SDS-PAGE for 
analysis. The result of the coupling reaction is depicted on Figure 14 A. The 
sequence of the DP 178c peptide (fragment of the HIV gp41 protein) is 
C YT SLIHSLIEESQNQQEKNEQELLELDKWASLWNWF (SEQ ID No : 176). 

B. Immunization of mice and IgG subtype determination 

[0437] 80 ,ug of Pili-DP178c was injected in saline intravenously into 

female Balb/c mice. These mice were boosted with the same amount of vaccine 
on day 14 and bled on day 24. DP178-specific IgG in serum was determined on 
day 24 in a DP 178 peptide specific ELISA (DP 178c peptide was conjugated to 
Ribonuclease A using the cross-linker SPDP). In Figure 14B, average results 
from two mice are shown as optical densities obtained with a 1 :50 dilution of the 
serum. 
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EXAMPLE 43 
Expression and purification of GRA2 polypeptide 
[0438] Gra2 is an antigen of Toxoplasma Gondii. The 59 c-terminal amino acids 

acids of GRA2 with a c-terminal linker of 6 amino acids (GSGGCG, SEQ ID No. 
1 77) were cloned into the pGEX-2T vector (Pharmacia, 27-480 1-01). Expression 
and purification of the GST-fusion protein was carried out as described in the 
instructions. GST was cleaved from GRA2 with thrombin while the fusion protein 
was bound to glutathione-sepharose-beads and the reaction stopped after 20 min. 
with 1 mM PMSF. The sepharose beads were then pelleted by centrifugation and 
the supernatant containing the GRA2-polypeptide was collected. The solution 
was then concentrated 10-fold with a Ultrafree-4 centrifugal filter-5K (Millipore, 
UFV4BCC25). To reduce disulfide bonds which might eventually have formed, 
the solution was treated with 20 mM DTT 1 h on ice. DTT was removed by 
loading the protein solution on a PD10 column (Pharmacia). Protein 
concentration was determined by the Lowry test and concentration of free 
cysteines in an Ellmann's test. The protein was subsequently analyzed by SDS- 
P AGE. The GRA2 protein can however not be detected by Commassie staining. 
A yield of 9 mg GRA2 was obtained from an 8 L culture. The GRA2 amino acid 
sequence is KEAAGRGMVT VGKKLANVES DRSTTTTQAP DSPNGLAETE 
VPVEPQQRAA HVPVPDFSQGSGGCG (SEQ ID No. 178) 

EXAMPLE 44 
Coupling of GRA2 to Pili 
A. Coupling of GRA2 to Pili. 
[0439] 6 ml of a 2.5 mg/ml Pili protein solution (produced as described in 

Example 33 B) were reacted with a 50 fold molar excess of Sulfo-MBS, and 
desalted over a PD10 column (Pharmacia), 1.5 ml of the reaction mixture were 
loaded on one column, 1 ml was added and the first 1.5 ml were collected. 
Fractions containing Pili were identified on a dot blot stained with amidoblack. 
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A 300 yug/ml solution of GRA2 was concentrated 100 fold, and 100 fA were 
reacted with 1 .2 ml of the desalted activated Pili solution for 4 hours at RT. The 
reaction mixture was then dialyzed against 21 of a 20 mMHepes, 150 mMNaCl, 
pH 7.2 overnight. Figure 15A shows an analysis of the coupling reaction. 

B. Immunization of mice with Pili-GRA2 and IgG subtype determination. 
[0440] Mice, were immunized with 50 4g of Pili-GRA2 and boosted on day 

14,vith the same amount of vaccine. Serum samples we're taken on day 0,6,14 
and 21 after the first immunization. GRA2 specific IgG in serum was determined 
on day 21 in a GRA2 specific ELISA. Results of two individual mice in each 
group are shown in Figure 15B. The titer was determined as the dilution of sera 
resulting in half-maximal optical density (OD 50 ). 



EXAMPLE 45 
Coupling of B2- and D2-peptide to Pili 
[0441] D2 and B2 peptides are sequences from the OmpC protein of Salmonella 

typhi. It is an outer membrane porin. High level of antiporin antibodies have been 
detected in the sera of patients with typhoid fever (Arocklasamy, A. and 
Krishnaswamy, S., FEES Letters 453: 380-82 (1999)). 

A. Coupling of B2- or D2-peptides of the ompC protein of Salmonella typhi 
to Pili 

[0442] 6 ml of a 2.5 mg/ml Pili protein solution (produced as described in . 

Example 33 B) were reacted with a 50 fold molar excess of Sulfo-MBS, and 
desalted over a PD10 column (Pharmacia). 1.5 ml of the reaction mixture were 
loaded on one column, 1 ml was added, and the first 1.5 ml were collected. 
Fractions containing Pili were identified on a dot blot stained with amidoblack. 
An aliquot of 5 //l of a 100 raM solution of peptide was reacted with 2.6 ml of the 
desalted activated Pili solution for 4 hours at RT. The reaction mixture was then 
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dialyzed against 21 of a 20 mM Hepes, 1 50 mM NaCl, pH 7.2 overnight. Figure 
16A shows an analysis of the coupling reaction. The sequence of the D2 peptide 
is CGG TSN GSN PST SYG FAN (SEQ ID No. 179). The sequence of the B2 
peptide is CGG DIS NGY GAS YGD NDI (SEQ ID No. 180). 

B. Immunization of mice with Pili-B2 and IgG subtype determination. 
[0443] Mice were immunized interaperitoneally in female Balb/c mice with 5 0 yug 

of Pili-B2 in saline and boosted on day 14 with the same amount of vaccine, and 
bled on day 33. B2-peptide specific IgG in serum was determined on day 33 in 
a B2-specific ELISA (B2 peptide was conjugated to Ribonuclease A with the 
cross-linker SPDP). Average of the results of two individual mice are shown in 
Figure 16B. 

EXAMPLE 46 

[0444] The muTNFa peptide, comprising amino acids 22-3 3 of TNFa protein was 

coupled to Pili as described in Example 42, except that no glucose was 
addedduring the final dialysis step, where the reaction solution was dialyzed 
against 20 mM Hepes, pH 7.4 only. Two Balb/c female mice, 8 days of age were 
immunized intravenously with 100 yg of Pili-muTNFa each. These mice were 
boosted at day 14 with the same amount of vaccine, and bled on day 20. IgG 
specific for native TNFa protein in serum was detected at day 20 in an ELISA. 
As a control, preimmune sera of two mice were assayed for binding to TNFa 
protein. See Figure 17. The sequence of the muTNFa peptide was 
CGGVEEQLEWLSQR (SEQ ID No. 181). 



EXAMPLE 47 

A.Preparation of bacterial type-1 pili coupled to TNF peptides 
[0445] Two peptides comprising murine TNFa sequences were designed. 

Peptide 3' murine TNFa II (3'-TNFaII) was S S QNS SDKP VAHVVANHGVGGC 
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(SEQ ID No. 182). Peptide 5' murine TNFa II (5* TNFa II) was 
CSSQNSSDKPVAHWANHGV (SEQ ID No. 183). The peptides 5 '-TNFa II 
and 3-TNFa II were coupled to bacterial type-1 pili as follows. An aliquot of 1 
ml of a Pili solution (2.5 mg/ml) was reacted with 503 jA of a 100 mM Sulfo-NMS 
solution for 45 minutes at RT. The reaction mixture was desalted over a desalting 
column previously saturated with Pili protein and equilibrated in 20 mM Hepes, 
pH 7.4. The fractions containing protein were pooled. Art aliquot of 1 ml of 
desalted Pili was mixed with 1.56 /A of peptide (100 mM in DMSO) ? and the 
reaction left to proceed for 4 hours at RT. The reaction solution was then 
dialyzed overnight against 20 mM Hepes, 1 50 mMNaCl, pH 7.4 in the cold. See 
Figure 18 A. 

B . Immunization and detection of antibodies specific for native TNFa and the 
3" TNFII and 5' TNFII peptides 
[0446] Balb/ c mice were vaccinated intraperitoneally with 3 0 /zg protein in saline, 

on day 0 3 14 and 33. IgG antibodies specific for native TNFa protein (Fig. 18B) 
and for the 3' TNFII and 5' TNFII peptides (Fig. 1 8C) were measured in a specific 
ELISA. 

1 . Native TNFa ELISA 

[0447] 2 Mg/ml native TNFa protein was coated on ELISA plates. Sera were 

added at different dilutions and bound IgG was detected with a horseradish 
peroxidase-conjugated anti-murine IgG antibody. Results from four individual 
mice are shown on day 21 and day 43. 

2. Anti peptide ELISA 

[0448] IgG antibodies specific for the 3' TNFII and 5 f TNFII peptides 

were measured in a specific ELISA 1 0 ug/ml Ribonuclease A coupled to 3' TNFII 
or 5'TNFII peptide was coated on ELISA plates. Sera were added at different 
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dilutions and bound IgG was detected with a horseradish peroxidase-conjugated 
anti-murine IgG antibody. Results from four individual mice are shown on day 2 1 . 

C. Analysis of sera from mice immunized under B.: IgG subtype 
determination 

[0449] Sera from the immunized mice described under B. were taken on 

day 50. Antibodies specific for the TNF peptides described under A. were 
measured in a specific ELISA on day 50. RNAse coupled to the corresponding 
TNF peptide was coated on ELISA plates at a concentration of 10 (ig/ml. Sera 
were added at different dilutions and bound antibody was detected with horse 
radish peroxidase-conjugated anti-murine antibodies. See Figure 18D. 

EXAMPLE 48 

Coupling of Pili to M2 peptide, immunization of mice, and IgG subtype 

determination 

[0450] M2 peptide was coupled to pili as described in Example 47. The peptide 

was reacted at a fivefold molar excess with the activated Pili. Female Balb/c mice 
were injected with 50 y% Pili-M2 in saline subcutaneously. Mice were boosted 
with the same amount of vaccine on day 14 and bled on day 27, M2 specific IgG 
in serum was determined on day 27 in a M2-specific ELISA (peptide conjugated 
to Ribonuclease A with the cross-linker SPDP for coating) . S ee Figures 1 9 A and 
19B. 

EXAMPLE 49 

Immunization of mice with HbcAg-Lys-2cys-Mut coupled to the Flag peptide, 

and IgG subtype determination 
[0451] Flag peptide (SEQ ID NO: 147) was coupled to HBcAg-Lys-2cvs-Mut as 

described in Example 39. Two Balb/c mice were vaccinated intravenously with 
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50 fig HBc-Ag-Lys-2cys-Mut -Flag. On day 14 mice were boosted with the same 
amount of vaccine and bled on day 40, Flag-specific antibodies (Flag peptide was 
conjugated to Ribonuclease A with the cross-linker SPDP for coating) in serum 
were measured on day 40 in a specific ELIS A. ELISA plates were coated with 
10 fzg /ml RNAse coupled to Flag peptide and serum was added at a 1 :40 dilution. 
Bound antibodies were detected with peroxidase conjugate isotype-specific IgG. 
Results from the two mice are shown as ELISA titers in Figure 20. 



EXAMPLE 50 
Purification of Type- 1 Pili of Eschericia coli 

[0452] Isolated Type-1 pili of Eschericia coli prepared as described in Example 

33B were precipitated with ammonium sulfate, added to a final concentration of 
0.5 M, at 4°C for 30 minutes. The pili were then pelleted by centrifugation at 
20,000 rpm for 15 min at 4°C and the pellet was resuspended in 25 ml of 20 niM 
HEPES buffer, pH 7.3. The precipitation step was repeated once, and the final 
sample was resuspended in 9 ml of 20 niM HEPES, pH 7.3 and finally dialyzed 
against the same buffer to remove residual ammonium sulfate. The pili were 
subsequently purified on an SR-400 size exclusion chromatography column (20 
mM HEPES, pH 7.3) and the pili containing fractions were collected and pooled. 

[0453] All patents and publications referred to herein are expressly incorporated 

by reference. 

[0454] The entire disclosure of U.S. Application No. 09/449,631, filed 

November 30, 1999, is herein incorporated by reference. All publications and 
patents mentioned hereinabove are hereby incorporated in their entireties by 
reference. 



WO 01/85208 



PCT/IB01/00741 



v -142- 

WHAT IS CLAIMED IS: 

1 . A composition comprising a bacterial pilus to which an antigen or 
antigenic determinant has been attached by a covalent bond. 

2. The composition of claim 1, wherein said covalent bond is not a 
peptide bond. 

3 . The composition of claim 1 , wherein said bacterial pilus is a Type- 1 
pilus of Escherichia colL 

4. The composition of claim 1, wherein pilin subunits of said Type-1 
pilus comprises the amino acid sequence shown in SEQ ID NO: 146 or a sequence 
having at least 65, 70, 75, 80, 85, 90 or 95% sequence identity to SEQ ID 
NG.146. 

5. The composition of claim 1, wherein said bacterial pilus and said 
antigen' or antigen determinant are attached via a non-naturally occurring 
attachment. 

6. The composition of claim 1 , wherein said attachment comprises an 
organizer comprising at least one first attachment site, and wherein said organizer 
is connected to said pilus by at least one covalent bond. 

7. The composition of claim 6, wherein said organizer is a 
polypeptide or a residue thereof, and wherein said second attachment site is a 
polypeptide or a residue thereof. 

8. The composition of claim 6, wherein said first and/or a second 
attachment sites comprise: 
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(a) an antigen and an antibody or antibody fragment thereto; 

(b) biotin and avidin; 

(c) strepavidin and biotin; 

(d) a receptor and its ligand; 

(e) a ligand-binding protein and its ligand; 

(f) interacting leucine zipper polypeptides; 

(g) an amino group and a chemical group reactive thereto; 

(h) a carboxyl group and a chemical group reactive thereto; 

(i) a sulfhydryl group and a chemical group reactive thereto; 

or 

(j) a combination thereof. 



9. The composition of claim 1, wherein said bacterial pilus and said 
antigen or antigentic derminant are attached by an attachment comprising 
interacting leucine zipper polypeptides. 

10. The composition of claim 5, wherein interacting leucine zipper 
polypeptides are JUN and/or FOS leucine zipper polypeptides. 

11. A composition comprising a bacterial pilin polypeptide to which 
an antigen or antigenic determinant has been attached by a covalent bond. 

12. The composition of claim 11, wherein said covalent bond is not a 
peptide bond. 

13. The composition of claim 1 1, wherein said polypeptide is from a 
Type-1 pilus of Escherichia coli. 
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14. The composition of claim 11, wherein said bacterial pilin 
polypeptide comprises the amino acid sequence shown in SEQ ID NO: 146 or a 
sequence having at least 65, 70, 75, 80, 85, 90 or 95% sequence identity to SEQ 
ID NO: 146. 

15. The composition of claim 11, wherein said bacterial pilin 
polypeptide and said antigen or antigenic determinant are attached by a non- 
naturally occurring attachment. 

16. The composition of claim 1 1, wherein said attachment comprises 
an organizer comprising at least one first attachment site, and wherein said 
organizer is connected to said pilus by at least one covalent bond. 

17. The composition of claim 16, wherein said organizer is a 
polypeptide or a residue thereof, and wherein said second attachment site is a 
polypeptide or a residue thereof. 

18. The composition of claim 11, wherein said first and/or a second 
attachment sites comprise: 



(a) 


an antigen and an antibody or antibody fragment thereto; 


(b) 


biotin and avidin; 


(c) 


strepavidin and biotin; 


(d) 


a receptor and its ligand; 


(e) 


a ligand-binding protein and its ligand; 


(f) 


interacting leucine zipper polypeptides; 


(g) 


an amino group and a chemical group reactive thereto; 


(h) 


a carboxyl group and a chemical group reactive thereto; 


(i) 


a sulfhydryl group and a chemical group reactive thereto; 


(i) 


a combination thereof. 
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19. The composition of claim 15, wherein said attachment comprises 
interacting leucine zipper polypeptides. 

20. The composition of claim 13, wherein said interacting leucine 
zipper polypeptides are JUN and/or FOS leucine zipper polypeptides. 

21 . A composition comprising: 

(a) a non-natural molecular scaffold comprising: 

(i) a core particle selected from the group consisting 

of: 

(1) a bacterial pilus or pilin protein; and 

(2) a recombinant form of a bacterial pilus or 

pilin protein; and 

(ii) an organizer comprising at least one first attachment 

site, 

wherein said organizer is connected to said core particle by at least one 
covalent bond; and 

(b) an antigen or antigenic determinant with at least one second 
attachment site, said second attachment site being selected from the group 
consisting of: 

(i) an attachment site not naturally occurring with said 
antigen or antigenic determinant; and 

(ii) an attachment site naturally occurring with said 
antigen or antigenic determinant, 

wherein said second attachment site is capable of association through at 
least one non-peptide bond to said first attachment site; and 

wherein said antigen or antigenic determinant and said scaffold interact 
through said association to form an ordered and repetitive antigen array. 
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22. The composition of claim 21, wherein said organizer is a 
polypeptide or residue thereof; and wherein said second attachment site is a 
polypeptide or residue thereof. 



23 . The composition of claim 2 1 , wherein said first and/or said second 
attachment sites comprise: 

(a) an antigen and an antibody or antibody fragment thereto; 

(b) biotin and avidin; 

(c) strepavidin and biotin; 

(d) a receptor and its ligand; 

(e) a ligand-binding protein and its ligand; 

(f) interacting leucine zipper polypeptides; 

(g) an amino group and a chemical group reactive thereto; 

(h) a carboxyl group and a chemical group reactive thereto; 

(i) a sulfhydryl group and a chemical group reactive thereto; 

or 

(j) a combination thereof 



24. The composition of claim 2 1 , wherein said first and/or said second 
attachment sites comprise interacting leucine zipper polypeptides. 

25 . The composition of claim 2 1 , wherein said bacterial pilus is a Type- 
1 pilus of Eschericia coli. 

26 . The composition of claim 2 1 , wherein pilus subunits of said type- 1 
pilus comprise the amino acid sequence of SEQ ID No. 146 or a sequence having 
at least 65, 70, 75, 80, 85, 90 or 95% sequence identity to SEQ ID NO: 146. 

27. The composition of claim 26, wherein said interacting leucine 
zipper polypeptides are the JUN and/or FOS leucine zipper polypeptides. 
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28. A composition comprising: 

(a) a non-natural molecular scaffold comprising: 

(i) a virus-like particle that is a dimer or a multimer of 
a polypeptide comprising amino acids 1-147 of SEQ ID NO: 158 as core particle 
or a sequence having at least 65, 70, 75, 80, 85, 90 or 95% sequence identity to 
SEQ ID NO: 158; and 

(ii) an organizer comprising at least one first attachment 

site, 

wherein said organizer is connected to said core particle by at least one 
covalent bond; and 

(b) an antigen or antigenic determinant with at least one second 
attachment site, said second attachment site being selected from the group 
consisting of: 

(i) an attachment site not naturally occurring with said 
antigen or antigenic determinant; and 

(ii) an attachment site naturally occurring with said 
antigen or antigenic determinant, 

wherein said second attachment site is capable of association through at 
least one non-peptide bond to said first attachment site; and 

wherein said antigen or antigenic determinant and said scaffold interact 
through said association to form an ordered and repetitive antigen array. 

29. The composition of claim 28, wherein said organizer is a 
polypeptide or residue thereof; and wherein said second attachment site is a 
polypeptide or residue thereof. 

30. The composition of claim 28, wherein said first and/or said second 
attachment sites comprise: 

(a) an antigen and an antibody or antibody fragment thereto; 

(b) biotin and avidin; 
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(c) strepavidin and biotin; 

(d) a receptor and its ligand; 

(e) a ligand-binding protein and its ligand; 

(f) interacting leucine zipper polypeptides; 

(g) an amino group and a chemical group reactive thereto; 

(h) a carboxyl group and a chemical group reactive thereto; 

(i) a sulfhydryl group and a chemical group reactive thereto; 

or 

(j) a combination thereof. 



3 1 . The composition of claim 30, wherein said first attachment site is 
an amino group and said second attachment site is a sulfhydryl group. 



32. The composition of claim 30, wherein said virus-like particle and 
said antigen or antigenic determinant are attached by an attachment comprising 
interacting leucine zipper polypeptides. 

33. The composition of claim 32, wherein said interacting leucine 
zipper polypeptides are JUN and/or FOS FOS polypeptides. 



34. A composition comprising: 

(a) a non-natural molecular scaffold comprising: 

(i) Hepatitis B virus capsid protein comprising an 
amino acid sequence selected from the group consisting of: 

(1) the amino acid sequence of SEQIDNO:89; 

(2) the amino acid sequence of SEQ ID NO : 90; 
(3 ) the amino acid sequence of SEQ ID NO : 93 ; 

(4) theamino acid sequence of SEQ ID NO: 98; 

(5) the amino acid sequence of SEQ ID NO: 99; 
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102; 



(6) the amino acid sequence of SEQ ID NO: 

(7) the amino acid sequence of SEQ ID NO: 
104; 

(8) the amino acid sequence of SEQ ID 

(9) the amino acid sequence of SEQ ID 

(10) the amino acid sequence of SEQ ID 

(11) the amino acid sequence of SEQ ID 

(12) the amino acid sequence of SEQ ID 

(13) the amino acid sequence of SEQ ID 

(14) the amino acid sequence of SEQ ID 

(15) the amino acid sequence of SEQ ID 

(16) the amino acid sequence of SEQ ID 

(17) the amino acid sequence of SEQ ID 

(18) the amino acid sequence of SEQ ID 
(ii) an organizer comprising at least one first attachment 

site, 

wherein said organizer is connected to said core particle by at least one 
covalent bond; and 



NO: 105; 
NO: 106; 
NO: 119; 
NO: 120; 
NO:123; 
NO: 125; 
NO:131; 
NO: 132; 
NO: 134; 
NO: 157; and 
NO: 158; and 
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(b) an antigen or antigenic determinant with at least one second 
attachment site, said second attachment site being selected from the group 
consisting of: 

(i) an attachment site not naturally occurring with said 
antigen or antigenic determinant; and 

(ii) an attachment site naturally occurring with said 
antigen or antigenic determinant, 

wherein said second attachment site is capable of association through at 
least one non-peptide bond to said first attachment site; and 

wherein said antigen or antigenic determinant and said scaffold interact 
through said association to form an ordered and repetitive antigen array. 

35. The composition of claim 34, wherein said organizer is a 
polypeptide or residue thereof, 

wherein said second attachment site is a polypeptide or residue thereof, 

and 

wherein said first attachment site is a lysine residue and said second 
attachment site is a cysteine residue. 

36. The composition of claim 34, wherein one or more cysteine 
residues of said Hepatitis B virus capsid protein have been either deleted or 
substituted with another amino acid residue. 

37. The composition of claim 34, wherein said first and/or said second 
attachment sites comprise: 

(a) an antigen and an antibody or antibody fragment thereto; 

(b) biotin and avidin; 

(c) strepavidin and biotin; 

(d) a receptor and its ligand; 

(e) a ligand-binding protein and its ligand; 
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(f) 
(g) 
(h) 
(i) 



interacting leucine zipper polypeptides; 
an amino group and a chemical group reactive thereto; 
a carboxyl group and a chemical group reactive thereto; 
a sulfhydryl group and a chemical group reactive thereto; 



or 



0) 



a combination thereof. 



38. The composition of claim 36, wherein the cysteine residues 
corresponding to amino acids 48 and 107 in SEQ ID NO: 134 have been either 
deleted or substituted with another amino acid residue. 

39. The composition of claim 37, wherein said Hepatitis B virus capsid 
protein and said antigen or antigenic determinant are attached by an attachment 
comprising interacting leucine zipper polypeptides. 

40. The composition of claim 39, wherein said interacting leucine 
zipper polypeptides are FOS and/or JUN polypeptides. 

41. The composition of any one of claims 28, 34, 35, 36 and 38, 
wherein said antigen is selected from the group consisting of; 



(a) 



an antigen suited to induce an immune response against 



bacteria, 



(b) 



an antigen suited to induce an immune response against 



viruses, 



(c) 



an antigen suited to induce an immune response against 



parasites, 



(d) 



an antigen suited to induce an immune response against 



cancer cells, 



(e) 



an antigen suited to induce an immune response against 



allergens, 
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(f) an antigen suited to induce an immune response in a farm 

animals, and 

(g) a protein suited to induce an immune response in a pet. 

42. The composition of claim 41, wherein the antigen is a protein, 
polypeptide, or a fragment thereof 

43. The composition of claim 47, wherein said antigen induces an 
immune response against one or more allergens. 



44. The composition of claim 47, wherein said antigen is: 



(a) 


a recombinant 


KP) 


a recomuinani 


(c) 


a recombinant 


(d) 


a recombinant 


(e) 


a recombinant 


(f) 


a recombinant 


(g) 


a recombinant 


(h) 


a recombinant 


(i) 


a recombinant 


0) 


a recombinant 


(k) 


a recombinant 


0) 


a recombinant 


(m) 


a recombinant 


(n) 


a recombinant 


(o) 


a recombinant 


(P) 


a recombinant 


(q) 


a recombinant 


(r) 


a recombinant 


(s) 


a recombinant 
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(t) a recombinant protein of Chlamydia. 

45. The composition of any one of claims 1, 1 1 and 21, wherein said 
antigen is selected from the group consisting of: 

(a) an antigen suited to induce an immune response against 

bacteria, 

(b) an antigen suited to induce an immune response against 

viruses, 

(c) an antigen suited to induce an immune response against 

parasites, 

(d) an antigen suited to induce an immune response against 

cancer cells, 

(e) an antigen suited to induce an immune response in a farm 

animals, and 

(f) an antigen suited to induce an immune response in a pet, 

and 

(g) any other antigen involved in a pathophysiological context. 

46. The composition of claim 45, wherein the antigen is a protein, a 
polypeptide, or a fragment thereof 

47. The composition of any one of claims 1, 11 or 21, wherein said 
antigen is: 

(a) a recombinant protein of HIV, 

(b) a recombinant protein of Influenza virus, 

(c) a recombinant protein of Hepatitis C virus, 

(d) a recombinant protein of Toxoplasma, 

(e) a recombinant protein of Plasmodium falciparum, 

(f) a recombinant protein of Plasmodium vivax, 

(g) a recombinant protein of Plasmodium ovale, 
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(h) a recombinant protein of Plasmodium malariae, 

(i) a recombinant protein of breast cancer cells, 
(j) a recombinant protein of kidney cancer cells, 
(k) a recombinant protein of prostate cancer cells, 
(1) a recombinant protein of skin cancer cells, 
(m) a recombinant protein of brain cancer cells, 
(n) a recombinant protein of leukemia cells, 

(o) a recombinant profiling, 

(p) a recombinant protein of Chlamydia. 

48. A pharmaceutical composition comprising the composition of any 
one of claims 1, 11, 21, 28, 34, 35, 36, 38, 41 or 44, and a pharmaceutical^ 
acceptable carrier. 

49. A vaccine composition comprising the composition of any one of 
claims 1, 11, 21, 28, 34, 35, 36, 38, 41 or 44. 

50. The vaccine composition of claim 49, further comprising at least 
one adjuvant. 

51. A method of immunizing, comprising administering to a subj ect the 
vaccine composition of claim 49 or 50. 

52. The method of claim 51, wherein said administering produces an 
immune response. 

53. The method of claim 51, wherein said administering produces a 
humoral immune response. 
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54. The method of claim 51, wherein said administering produces a 
cellular immune response. 

55. The method of claim 51, wherein said administering produces a 
humoral immune response and a cellular immune response. 

56. The method of claim 51, wherein said administering produces a 
protective immune response. 

57. A method of making the composition of claim 1, comprising 
combining said pilus and said antigen or antigenic determinant, wherein said pilus 
and said antigen or antigenic determinant interact to form an antigen array. 

58. The method of claim 57, wherein said antigen array is ordered 
and/or repetitive. 

59. A method of making the composition of claim 11, comprising 
combining said pilin polypeptide and said antigen or antigenic determinant, 
wherein said pilin polypeptide and said antigen or antigenic determinant interact 
to form an antigen array. 

60. The method of claim 61, wherein said antigen array is ordered 
and/or repetitive. 

61. A method of making the composition of claim 21, 28, 34, 35, 36 
or 38, comprising combining said non-natural molecular scaffold and said antigen 
or antigenic determinant, wherein said non-natural molecular scaffold and said 
antigen or antigenic determinant interact to form an antigen array. 
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62. The method of claim 61, wherein said antigen array is ordered 
and/or repetitive. 

63. A composition comprising; 

(a) a non-natural molecular scaffold comprising: 

(i) a core particle selected from the group consisting 

of: 

(1) a bacterial pilus; and 

(2) a recombinant form of a bacterial pilus or 

pilin protein; and 

(ii) an organizer comprising at least one first attachment 

site, 

wherein said organizer is connected to said core particle by at least one 
covalent bond; and 

(b) an antigen or antigenic determinant with at least one second 
attachment site, said second attachment site being selected from the group 
consisting of: 

(i) an attachment site not naturally occurring with said 
antigen or antigenic determinant; and 

(ii) an attachment site naturally occurring with said 
antigen or antigenic determinant, 

wherein said second attachment site is capable of association through at 
least one non-peptide bond to said first attachment site; 

wherein said antigen or antigenic determinant and said scaffold interact 
through said association to form an ordered and repetitive antigen array, and 

wherein said antigen or antigenic determinant is selected from the group 
consisting of an influenza M2 peptide, the GRA2 polypeptide, the DP 178c 
peptide, the tumor necrosis factor polypeptide, a tumor necrosis factor peptide, 
the B2 peptide, the D2 peptide, and the Ap peptide. 
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64. The composition of claim 63, wherein said antigen or antigenic 
determinant is the influenza M2 peptide or variants thereof. 

65. The composition of claim 63 5 wherein said antigen or antigenic 
determinant is the GRA2 polypeptide. 

66. The composition of claim 63, wherein said antigen or antigenic 
determinant is the DP 178c peptide. 

67. The composition of claim 63, wherein said antigen or antigenic 
determinant is the tumor necrosis factor polypeptide. 

68. The composition of claim 63, wherein said antigen or antigenic 
determinant is a tumor necrosis factor peptide. 

69. The composition of claim 63, wherein said antigen or antigenic 
determinant is the B2 peptide. 

70. The composition of claim 63, wherein said antigen or antigenic 
determinant is the D2 peptide. 

71. The composition of claim 63, wherein said antigen or antigenic 
determinant is the Ap peptide. 

72. The composition of claim 63, wherein said organizer is a 
polypeptide or residue thereof; and wherein said second attachment site is a 
polypeptide or residue thereof. 

73 . The composition of claim 63 , wherein said first and/ or said second 
attachment sites comprise: 
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(a) an antigen and an antibody or antibody fragment thereto; 

(b) biotin and avidin; 

(c) strepavidin and biotin; 

(d) a receptor and its ligand; 

(e) a ligand-binding protein and its ligand; 

(f) interacting leucine zipper polypeptides; 

(g) an amino group and a chemical group reactive thereto; 

(h) a carboxyl group and a chemical group reactive thereto; 

(i) a sulfhydryl group and a chemical group reactive thereto; 

or 

(j) a combination thereof. 



74. The composition of claim 63, wherein said first and/or said second 
attachment sites comprise interacting leucine zipper polypeptides. 

7 5 . The composition of claim 63 , wherein said bacterial pilus is a Type- 
1 pilus of Eschericia coli. 

I 

76. The composition of claim 63, wherein pilus subunits of said type- 1 
pilus comprise the amino acid sequence of SEQ ID No. 146 or a sequence having 
at least 65, 70, 75, 80, 85, 90 or 95% sequence identity to SEQ ID NO: 146. 

77. The composition of claim 63, wherein said interacting leucine 
zipper polypeptides are the JUN and/or FOS leucine zipper polypeptides. 

78 . A vaccine composition comprising the composition of claim 63 or 
claim 43. 



79. Amethod of immunizing, comprising administering to a subject the 
vaccine composition of claim 49 or 50. 
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80. The method of claim 79, wherein said administering produces an 
immune response. 

81. A method of making the composition of claim 63 , comprising 
combining said non-natural molecular scaffold and said antigen or antigenic 
determinant, wherein said non-natural molecular scaffold and said antigen or 
antigenic determinant interact to form an antigen array. 

82. The method of claim 81, wherein said antigen array is ordered 
and/or repetitive. 

83. A method of immunizing, comprising administering the 
composition of any one of claims 1, 1 1, 21, 49 or 50 to a subject, wherein for 
inducing a Th2 response, wherein said administering produces a Th2 response that 
is specific for said antigen or antigenic determinant. 

84. The method of claim 83, wherein antibodies specific for said 
antigen or antigenic determinant of a subtype corresponding to the Th2 subtype 
are induced in the subject. 

85. The method of claim 83, wherein the subject does not generate a 
Thl response that is specific for said pilus, said pilin polypeptide, or said antigen 
or antigenic determinant. 
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SEQUENCE LISTING 



<110> Cytos Biotechnology GmbH 
Sebbel, Peter 
Dunant, Nicolas 
Bachmann, Martin 
Tissot, Alain 
Lechner, Franziska 

<12 0> Molecular Antigen Array 



<130> 1700.018PC02 

<140> 
<141> 

<160> 186 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 1 

ggggacgcgt gcagcaggta accaccgtta aagaaggcac c 



<210> 2 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 2 

cggtggttac ctgctgcacg cgttgcttaa gcgacatgta gcgg 



<210> 3 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 3 

ccatgaggcc tacgataccc 



<210> 4 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 



<400> 4 

ggcactcacg gcgcgcttta caggc 



25 



<210> 5 
<211> 47 
<212> DNA 

<213> Artificial Sequence * 
<220> 

<223> Primer 
<400> 5 

ccttctttaa cggtggttac ctgctggcaa ccaacgtggt tcatgac 47 



<210> 6 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 7 
<211> 90 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 7 

gggtctagat tcccaaccat tcccttatcc aggctttttg acaacgctat gctccgcgcc 60 
catcgtctgc accagctggc ctttgacacc 90 



<210> 8 
<211> 108 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 8 

gggtctagaa ggaggtaaaa aacgatgaaa aagacagcta tcgcgattgc agtggcactg 60 
gctggtttcg ctaccgtagc gcaggccttc ccaaccattc ccttatcc 108 



<210> 9 
<211> 31 
<212> DNA 

<213> Artificial Sequence 



<400> 6 

aagcatgctg cacgcgtgtg cggtggtcgg atcgcccggc 



40 



<220> 
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<223> Primer 



<400> 9 

cccgaattcc tagaagccac agctgccctc c 



31 



<210> 10 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 10 

cctgcggtgg tctgaccgac accc 24 



<210> 11 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 12 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 12 

ctatcatcta gaatgaatag aggattcttt aac 33 



<210> 13 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Modified ribosome 
binding site 



<400> 11 

ccgcggaaga gccaccgcaa ccaccgtgtg ccgccaggat g 



41 



<400> 13 

aggaggtaaa aaacg 



15 



<210> 14 
<211> 21 
<212> PRT 



<213> Artificial Sequence 



<220> 

<223> signal peptide 
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<400> 14 

Met Lys Lys Thr Ala lie Ala He Ala Val Ala Leu Ala Gly Phe Ala 
1 ~ 5 10 15 

Thr Val Ala Gin Ala 
20 



<210> 15 
<211> 46 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> modified Fos 
construct 

<400> 15 

Cys Gly Gly Leu Thr Asp Thr Leu Gin Ala Glu Thr Asp Gin Val Glu 
1 5 10 15 

Asp Glu Lys Ser Ala Leu Gin Thr Glu He Ala Asn Leu Leu Lys Glu 
20 25 30 

Lys Glu Lys Leu Glu Phe He Leu Ala Ala His Gly Gly Cys 
35 40 45 



<210> 16 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide linker 
<400> 16 

Ala Ala Ala Ser Gly Gly 
1 5 



<210> 17 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide linker 
<400> 17 

Gly Gly Ser Ala Ala Ala 
1 5 



<210> 18 
<2H> 256 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Fos fusion construct 



<400> 18 
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gaattcagga ggtaaaaaac gatgaaaaag acagctatcg cgattgcagt ggcactggct 60 

ggtttcgcta ccgtagcgca ggcctgggtg ggggcggccg cttctggtgg ttgcggtggt 120 

ctgaccgaca ccctgcaggc ggaaaccgac caggtggaag acgaaaaatc cgcgctgcaa 180 

accgaaatcg cgaacctgct gaaagaaaaa gaaaagctgg agttcatcct ggcggcacac 24 0 

ggtggttgct aagctt 256 



<210> 19 
<211> 52 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Fos fusion construct 
<400> 19 

Ala Ala Ala Ser Gly Gly Cys Gly Gly Leu Thr Asp Thr Leu Gin Ala 

5 10 ,15 

Glu Thr Asp Gin Val Glu Asp Glu Lys Ser Ala Leu Gin Thr Glu lie 
20 25 30 

Ala Asn Leu Leu Lys Glu Lys Glu Lys Leu Glu Phe lie Leu Ala Ala 
35 40 45 

His Gly Gly Cys 
50 



<210> 20 
<211> 261 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Fos fusion 
construct 

<220> 

<221> CDS 

<222> (22) . . (240) 

<400> 20 

gaattcagga ggtaaaaaac g atg aaa aag aca get ate gcg att gca gtg 51 

Met Lys Lys Thr Ala lie Ala lie Ala Val 
1 5 10 

gca ctg get ggt ttc get acc gta gcg cag gec tgc ggt ggt ctg acc 99 
Ala Leu Ala Gly Phe Ala Thr Val Ala Gin Ala Cys Gly Gly Leu Thr 
15 20 25 

gac acc ctg cag gcg gaa acc gac cag gtg gaa gac gaa aaa tec gcg 147 
Asp Thr Leu Gin Ala Glu Thr Asp Gin Val Glu Asp Glu Lys Ser Ala 
30 35 40 

ctg caa acc gaa ate gcg aac ctg ctg aaa gaa aaa gaa aag ctg gag 195 
Leu Gin Thr Glu lie Ala Asn Leu Leu Lys Glu Lys Glu Lys Leu Glu 
45 50 55 

ttc ate ctg gcg gca cac ggt ggt tgc ggt ggt tct gcg gee get 240 
Phe lie Leu Ala Ala His Gly Gly Cys Gly Gly Ser Ala Ala Ala 
60 65 70 
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gggtgtgggg atatcaagct t 2 61 

<210> 21 
<211> 73 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Fos fusion 
construct 

<400> 21 

Met Lys Lys Thr Ala lie Ala lie Ala Val Ala Leu Ala Gly Phe Ala 
15 10 15 

Thr Val Ala Gin Ala Cys Gly Gly Leu Thr Asp Thr Leu Gin Ala Glu 
20 25 30 

Thr Asp Gin Val Glu Asp' Glu Lys Ser Ala Leu Gin Thr Glu He Ala 
35 40 45 

Asn Leu Leu Lys Glu Lys Glu Lys Leu Glu Phe He Leu Ala Ala His 
50 " 55 60 

Gly Gly Cys Gly Gly Ser Ala Ala Ala 
65 70 



<210> 22 
<211> 196 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Fos fusion 
construct 

<220> 

<221> CDS 

<222> (34) . . (189) 

<400> 22 

gaattcagga ggtaaaaaga tatcgggtgt ggg gcg gcc get tct ggt ggt tgc 54 

Ala Ala Ala Ser Gly Gly Cys 
1 5 

ggt ggt ctg acc gac acc ctg cag gcg gaa acc gac cag gtg gaa gac 102 
Gly Gly Leu Thr Asp Thr Leu Gin Ala Glu Thr Asp Gin Val Glu Asp 
10 15 20 

gaa aaa tec gcg ctg caa acc gaa ate gcg aac ctg ctg aaa gaa aaa 150 
Glu Lys Ser Ala Leu Gin Thr Glu -He Ala Asn Leu Leu Lys Glu Lys 
25 30 35 

gaa aag ctg gag ttc ate ctg gcg gca cac ggt ggt tgc taagctt 196 
Glu Lys Leu Glu Phe He Leu Ala Ala His Gly Gly Cys 
40 45 50 



<210> 23 
<211> 52 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Fos fusion 
construct 

<400> 23 

Ala Ala Ala Ser Gly Gly Cys Gly Gly Leu Thr Asp Thr Leu Gin Ala 
15 10 15 

Glu Thr Asp Gin Val Glu Asp Glu Lys Ser Ala Leu Gin Thr Glu lie 
20 25 30 

Ala Asn Leu Leu Lys Glu Lys Glu Lys Leu Glu Phe lie Leu Ala Ala 
35 40 45 

His Gly Gly Cys 
50 



<210> 24 
<211> 204 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Fos fusion 
construct 

<400> 24 

gaattcagga ggtaaaaaac gatggcttgc 
accgaccagg tggaagacga aaaatccgcg 
gaaaaagaaa agctggagtt catcctggcg 
gctgggtgtg gggatatcaa gctt 



ggtggtctga ccgacaccct gcaggcggaa 60 
ctgcaaaccg aaatcgcgaa cctgctgaaa 120 
gcacacggtg gttgcggtgg ttctgcggcc 180 

204 



<210> 25 
<211> 56 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Fos fusion 
construct 

<400> 25 

Lys Thr Met Ala Cys Gly Gly Leu Thr Asp Thr Leu Gin Ala Glu Thr 

1 5 10 15 

Asp Gin Val Glu Asp Glu Lys Ser Ala Leu Gin Thr Glu lie Ala Asn 

20 25 30 

Leu Leu Lys Glu Lys Glu Lys Leu Glu Phe lie Leu Ala Ala His Gly 

35 40 45 

Gly Cys Gly Gly Ser Ala Ala Ala 
50 55 



<210> 26 

<211> 26 

<212> PRT 

<213> Homo sapiens 



<400> 26 
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Met Ala Thr Gly Ser Arg Thr Ser Leu Leu Leu Ala Phe Gly Leu Leu 
15 10 15 



Cys Leu Pro Trp Leu Gin Glu Gly Ser Ala 
20 25 



<210> 27 
<211> 262 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Fos fusion 
construct 

<400> 27 

gaattcaggc ctatggctac aggctcccgg acgtccctgc tcctggcttt tggcctgctc 60 
tgcctgccct ggcttcaaga gggcagcgct gggtgtgggg cggccgcttc tggtggttgc 12 0 
ggtggtctga ccgacaccct gcaggcggaa accgaccagg tggaagacga aaaatccgcg 18 0 
ctgcaaaccg aaatcgcgaa cctgctgaaa gaaaaagaaa agctggagtt catcctggcg 240 
gcacacggtg gttgctaagc tt 262 



<210> 28 
<211> 52 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Fos fusion 
construct 

<400> 28 

Ala Ala Ala Ser Gly Gly Cys Gly Gly Leu Thr Asp Thr Leu Gin Ala 
5 10 15 

Glu Thr Asp Gin Val Glu Asp Glu Lys Ser Ala Leu Gin Thr Glu lie 
20 25 30 

Ala Asn Leu Leu Lys Glu Lys Glu Lys Leu Glu Phe lie Leu Ala Ala 
35 40 45 

His Gly Gly Cys 
50 



<210> 29 
<211> 261 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Fos fusion 
construct 

<220> 

<221> CDS 

<222> (7) . . (240) 



<400> 29 
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gaattc atg get aca ggc tec egg acg tec ctg etc ctg get ttt ggc 48 
Met Ala Thr Gly Ser Arg Thr Ser Leu Leu Leu Ala Phe Gly 
15 10 

ctg etc tgc ctg ccc tgg ctt caa gag ggc age get tgc ggt ggt ctg 96 
Leu Leu Cys Leu Pro Trp Leu Gin Glu Gly Ser Ala Cys Gly Gly Leu 
15 ^ 20 25 30 

acc gac ace ctg cag gcg gaa acc gac cag gtg gaa gac gaa aaa tec 144 
Thr Asp Thr Leu Gin Ala Glu Thr Asp Gin Val Glu Asp Glu Lys Ser 
35 40 45 

gcg ctg caa acc gaa ate gcg aac ctg ctg aaa gaa aaa gaa aag ctg 192 
Ala Leu Gin Thr Glu lie Ala Asn Leu Leu Lys Glu Lys Glu Lys Leu 
50 ' 55 60 

gag ttc ate ctg gcg gca cac ggt ggt tgc ggt ggt tct gcg gee get 240 
Glu Phe lie Leu Ala Ala His Gly Gly Cys Gly Gly Ser Ala Ala Ala 
65 70 75 

gggtgtggga ggectaaget t 2 61 

<210> 30 
<211> 78 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Fos fusion 
construct 

<400> 30 

Met Ala Thr Gly Ser Arg Thr Ser Leu Leu Leu Ala Phe Gly Leu Leu 
1 5 10 15 

Cys Leu Pro Trp Leu Gin Glu Gly Ser Ala Cys Gly Gly Leu Thr Asp 
20 25 30 

Thr Leu Gin Ala Glu Thr Asp Gin Val Glu Asp Glu Lys Ser Ala Leu 
35 40 45 

Gin Thr Glu lie Ala Asn Leu Leu Lys Glu Lys Glu Lys Leu Glu Phe 
50 55 60 

lie Leu Ala Ala His Gly Gly Cys Gly Gly Ser Ala Ala Ala 
65 70 75 



<210> 31 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 31 

cctgggtggg ggcggccgct tctggtggtt gcggtggtct gacc 4 4 



<210> 32 
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<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 32 

ggtgggaatt caggaggtaa aaagatatcg ggtgtggggc ggcc 4 4 



<210> 33 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 34 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 34 

gcttgcggtg gtctgacc 18 



<210> 35 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 36 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 36 

ccaccaagct tgatatcccc acacccagcg gccgcagaac caccgcaacc accg 54 



<210> 37 
<211> 32 
<212> DNA 

<213> Artificial/ Sequence 



<400> 33 

ggtgggaatt caggaggtaa aaaacgatgg cttgcggtgg tctgacc 



47 



<400> 35 

ccaccaagct tagcaaccac cgtgtgc 



27 
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<220> 

<223> Primer 



<400> 37 



ccaccaagct taggcctccc acacccagcg gc 



32 



<210> 38 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 38 

ggtgggaatt caggaggtaa aaaacgatg 29 



<210> 39 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 40 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 40 

ggtgggaatt catggctaca ggctccc 27 



<210> 41 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 41 

gggtctagaa tggctacagg ctcccggacg tccctgctcc tggcttttgg cctgctctg 59 



<210> 42 
<211> 58 
<212> DNA 

<213> Artificial Sequence 



<400> 39 

ggtgggaatt caggcctatg gctacaggct cc 



32 



<220> 

<223> Primer 
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<400> 42 

cgcaggcctc ggcactgccc tcttgaagcc agggeaggca gagcaggcca aaagccag 58 

<210> 43 
<211> 402 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Modified bee 

venom phospholipase A2 

<220> 

<221> CDS 

<222> (1) . . (402) 

<400> 43 

ate ate tac cca ggt act ctg tgg tgt ggt cac ggc aac aaa tct tct 48 

lie lie Tyr Pro Gly Thr Leu Trp Cys Gly His Gly Asn Lys Ser Ser 

1 ' 5 ~ 10 15 

ggt ccg aac gaa etc ggc cgc ttt aaa cac acc gac gca tgc tgt cgc 96 
Gly Pro Asn Glu Leu Gly Arg Phe Lys His Thr Asp Ala Cys Cys Arg 
20 25 30 

acc cag gac atg tgt ccg gac gtc atg tct get ggt gaa tct aaa cac 14 4 
Thr Gin Asp Met Cys Pro Asp Val Met Ser Ala Gly Glu Ser Lys His 
35 40 45 

ggg tta act aac acc get tct cac acg cgt etc age tgc gac tgc gac 192 
Gly Leu Thr Asn Thr Ala Ser His Thr Arg Leu Ser Cys Asp Cys Asp 
50 55 60 

gac aaa ttc tac gac tgc ctt aag aac tec gec gat acc ate tct tct 240 
Asp Lys Phe Tyr Asp Cys Leu Lys Asn Ser Ala Asp Thr lie Ser Ser 
65 " 7 0 ~ 7 5 8 0 

tac ttc gtt ggt aaa atg tat ttc aac ctg ate gat acc aaa tgt tac 288 
Tyr Phe Val Gly Lys Met Tyr Phe Asn Leu lie Asp Thr Lys Cys Tyr 
8 5 90 95 

aaa ctg gaa cac ccg gta acc ggc tgc ggc gaa cgt acc gaa ggt cgc 336 
Lys Leu Glu His Pro Val Thr Gly Cys Gly Glu Arg Thr Glu Gly Arg 
100 105 110 

tgc ctg cac tac acc gtt gac aaa tct aaa ccg aaa gtt tac cag tgg 384 
Cys Leu His Tyr Thr Val Asp Lys Ser Lys Pro Lys Val Tyr Gin Trp 
115 ~ 120 125 

ttc gac ctg cgc aaa tac 402 
Phe Asp Leu Arg Lys Tyr 
130 

<210> 44 
<211> 134 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Modified bee 

venom phospholipase A2 
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<400> 44 

lie lie Tyr Pro Gly Thr Leu Trp Cys Gly His Gly Asn Lys Ser Ser 
1 ^ 5 ' 10 15 

Gly Pro Asn Glu Leu Gly Arg Phe Lys His Thr Asp Ala Cys Cys Arg 
20 25 30 

Thr Gin Asp Met Cys Pro Asp Val Met Ser Ala Gly Glu Ser Lys His 
35 "40 45 

Gly Leu Thr Asn Thr Ala Ser His Thr Arg Leu Ser Cys Asp Cys Asp 
50 55 60 

Asp Lys Phe Tyr Asp Cys Leu Lys Asn Ser Ala Asp Thr lie Ser Ser 
65 ~ 70 75 80 

Tyr Phe Val Gly Lys Met Tyr Phe Asn Leu He Asp Thr Lys Cys Tyr 
85 90 95 

Lys Leu Glu His Pro Val Thr Gly Cys Gly Glu Arg Thr Glu Gly Arg 
100 105 110 

Cys Leu His Tyr Thr Val Asp Lys Ser Lys Pro Lys Val Tyr Gin Trp 
115 ~ 120 125 

Phe Asp Leu Arg Lys Tyr 
130 



<210> 45 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 45 

ccatcatcta cccaggtac 19 

<210> 46 
<2H> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 46 

cccacaccca gcggccgcgt atttgcgcag gtcg 34 

<210> 47 
<2H> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 47 
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cggtggttct gcggccgcta tcatctaccc aggtac 



36 



<210> 48 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 48 

ttagtatttg cgcaggtcg 19 



<210> 49 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 50 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 50 

accaccagaa gcggccgcag gggaaacaca tctgcc 3 6 



<210> 51 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 52 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 49 

ccggctccat cggtgcag 



18 



<400> 51 

cggtggttct gcggccgctg gctccatcgg tgcag 



35 



<400> 52 

ttaaggggaa acacatctgc c 



21 
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<210> 53 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 53 

actagtctag aatgagagtg aaggagaaat ate 



<210> 54 
<211> 42 
<212> DNA ■ 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 54 

tagcatgeta gcaccgaatt tatctaattc caataattct tg 



<210> 55 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 55 

gtagcaccca ccaaggcaaa gctgaaagct acccagctcg agaaactggc 



<210> 56 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 56 

caaagctcct attcccactg ccagtttctc gagctgggta gctttcag 



<210> 57 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 57 

ttcggtgcta gcggtggctg cggtggtctg accgac 



<210> 58 
<2ll> 37 
<2l2> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 58 

gatgctgggc ccttaaccgc aaccaccgtg tgccgcc 37 



<210> 59 
<211> 46 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> JUN amino acid 
sequence 

<400> 59 

Cys Gly Gly Arg lie Ala Arg Leu Glu Glu Lys Val Lys Thr Leu Lys 
1 5 10 15 

Ala Gin Asn Ser Glu Leu Ala Ser Thr Ala Asn Met Leu Arg Glu Gin 
20 25 30 

Val Ala Gin Leu Lys Gin Lys Val Met Asn His Val Gly Cys 
35 40 45 



<210> 60 
<211> 46 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> FOS amino 

acid sequence 

<400> 60 

Cys Gly Gly Leu Thr Asp Thr Leu 
1 5 

Asp Glu Lys Ser Ala Leu Gin Thr 
20 

Lys Glu Lys Leu Glu Phe He Leu 

35 4 0 



Gin Ala Glu Thr Asp Gin Val Glu 

10 15 

Glu He Ala Asn Leu Leu Lys Glu 

25 30 

Ala Ala His Gly Gly Cys 
4 5 



<210> 61 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 61 

ccggaattca tgtgcggtgg tcggatcgcc egg 33 



<210> 62 
<211> 39 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 62 

gtcgctaccc gcggctccgc aaccaacgtg gttcatgac 39 



<210> 63 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 63 

gttggttgcg gagccgcggg tagcgacatt gacccttata aagaatttgg 50 



<210> 64 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 64 

cgcgtcccaa gcttctacgg aagcgttgat aggatagg 



<210> 65 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 65 

ctagccgcgg gttgcggtgg tcggatcgcc egg 



<210> 66 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 66 

cgcgtcccaa gcttttagca accaacgtgg ttcatgac 38 



<210> 67 
<211> 31 
<212> DNA 

<213> Artificial Sequence 



38 



33 
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<220> 

<223> Primer 



<400> 67 

ccggaattca tggacattga cccttataaa g 



31 



<210> 68 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 68 

ccgaccaccg caacccgcgg ctagcggaag cgttgatagg atagg 4 5 



<210> 69 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 70 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 70 

gtcgctaccc gcggctccgc aaccaacgtg gttcatgac 39 



<210> 71 
<211> 31. 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 69 

ctaatggatc cggtgggggc tgcggtggtc ggatcgcccg gctcgag 



47 



<400> 71 

ccggaattca tggacattga cccttataaa g 



31 



<210> 72 
<211> 48 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Primer 
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<400> 72 

ccgaccaccg cagcccccac cggatccatt agtacccacc caggtagc 



<210> 73 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 73 

gttggttgcg gagccgcggg tagcgaccta gtagtcagtt atgtc 



<210> 74 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 74 

cgcgtcccaa gcttctacgg aagcgttgat aggatagg 



<210> 75 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 75 

ctagccgcgg gttgcggtgg tcggatcgcc egg 



<210> 76 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 76 

cgcgtcccaa gcttttagca accaacgtgg ttcatgac 



<210> 77 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 77 

ceggaattea tggccacact tttaaggagc 
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<210> 78 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Primer 
<400> 78 

cgcgtcccaa gcttttagca accaacgtgg ttcatgac 



<210> 79 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 79 

ccggaattca tggacattga cccttataaa g 



<210> 80 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 80 

cctagagcca cctttgccac catcttctaa attagtaccc acccaggtag 



<210> 81 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 81 

gaagatggtg gcaaaggtgg ctctagggac ctagtagtca gttatgtc 



<210> 82 
<211> 38 
<212> DNA ■ 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 82 

cgcgtcccaa gcttctaaac aacagtagtc tccggaag 



<210> 83 
<2H> 36 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 83 

gccgaattcc tagcagctag caccgaattt atctaa 



<210> 84 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 84 

ggttaagtcg acatgagagt gaaggagaaa tat 



<210> 85 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 85 

taaccgaatt caggaggtaa aaagatatgg 



<210> 86 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 86 

gaagtaaagc ttttaaccac cgcaaccacc agaag 



<210> 87 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 87 

tcgaatgggc cctcatcttc gtgtgctagt cag 



<210> 88 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Fos fusion 
construct 

<400> 88 
Glu Phe Arg Arg 
1 



<210> 89 
<211> 183 
<212> PRT 

<213> Hepatitis B virus 
<400> 89 

Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 " "40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Gly Asn Leu Glu Asp Pro He 
65 70 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 " 95 

Phe Arg Gin Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 135 140 

Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

Gin Ser Arg Gly Ser Gin Cys 
180 



<210> 90 

<211> 183 

<212> PRT 

<213> Hepatitis B virus 

<400> 90 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
• 1 5 10 15 



Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 
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Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Gly Asn Leu Glu Asp Pro Thr 
65 70 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

Phe Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Thr Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 135 140 

Glu Thr Cys Val lie Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 "* 150 " 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

Gin Ser Arg Gly Ser Gin Cys 
180 



<210> 91 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 91 

Met Gin Leu Phe His Leu Cys Leu lie He Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 " 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Gly Asn Leu Glu Asp Pro He Ser Arg Asp 
100 105 HO 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 " 135 140 



WO 01/85208 



PCT/IB01/00741 



lie Glu Tyr Leu Val Ser 
145 150 

Tyr Arg Pro Pro Asn Ala 
165 

Val Val Arg Arg Arg Gly 
180 

Arg Arg Arg Arg Ser Gin 
195 

Glu Ser Gin Cys 
210 



-24- 

Phe Gly Val Trp lie Arg Thr 
155 

Pro lie Leu Ser Thr Leu Pro 
170 

Arg Ser Pro Arg Arg Arg Thr 
185 

Ser Pro Arg Arg Arg Arg Ser 
200 205 



<210> 92 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 92 

Met Gin Leu Phe His Leu Cys Leu lie lie Ser Cys Ser 
15 10 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly 
20 25 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
50 " 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
65 70 75 

His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
85 90 

Leu Ala Thr Trp Val Gly Gly Asn Leu Glu Asp Pro lie 
100 105 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
115 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg 
130 135 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
145 150 155 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
.165 17 0 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
180- 185 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
195 200 205 

Glu Ser Gin Cys 
210 



Pro Pro Ala 
160 

Glu Thr Thr 
175 



Pro Ser Pro 
190 

Gin Ser Arg 



Cys Pro Thr 
15 

Met Asp He 
30 



Ser Phe Leu 



Asn Ala Ser 



Ser Pro His 
80 

Leu Met Thr 
95 

Ser Arg Asp 
110 

Phe Arg Gin 



Glu Thr Val 



Pro Pro Ala 
160 

Glu Thr Thr 
175 



Pro Ser Pro 
190 

Gin Ser Arg 
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<210> 93 
<211> 183 
<212> PRT 

<213> Hepatitis B virus 
<400> 93 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 ~ 5 10 15 

Ser Phe Leu Pro Thr Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
65 70 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

Phe Arg Gin Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 ~ " 135 140 

Glu Thr Cys Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

Gin Ser Arg Glu Ser Gin Cys 
180 



<210> 94 

<211> 212 

<212> PRT 

<213> Hepatitis B virus 

<400> 94 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr 
1 5 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
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65 70 75 80 

His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Asp Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Gly Asn Leu Glu Asp Pro Val Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Val Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 ' 135 140 

lie Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Vto He Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 ' 205 

Glu Ser Gin Cys 
210 



<210> 95 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 95 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Asp Met Asp He 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 4 0 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 ~ ~ 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Asp Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Gly Asn Leu Glu Asp Pro Val Ser Arg Asp 
100 ~ 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Val Gly Leu Lys Phe Arg Gin 
115 " 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 
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Ile Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr Pro Pro Ala 
145 " 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu Thr Thr 
165 o 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 96 

<211> 212 

<212> PRT 

<213> Hepatitis B virus 

<400> 96 

Met Gin Leu Phe His Leu Cys Leu lie lie Ser Cys Ser Cys Pro Thr 
1 5 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp lie 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 4 0 4 5 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 ' 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro Gin 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Gly Asn Leu Glu Asp Pro lie Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 ~ 120 " 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 ~ 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 
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<210> 97 
<211> 212 
<212> PRT 
<213> Hepatitis 

<400> 97 
Met Gin Leu Phe* 
1 

Val Gin Ala Ser 

20 

Asp Pro Tyr Lys 
35 

Pro Ser Asp Phe 
50 

Ala Leu Tyr Arg 
65 

His Thr Ala Leu 



Leu Ala Thr Trp 
100 



B virus 



His Leu Cys Leu 
5 



Lys Leu Cys Leu 



Glu Phe Gly Ala 
40 



Phe Pro Ser Val 
55 



Glu Ala Leu Glu 
70 



Arg Gin Ala lie 
85 



Val Gly Val Asn 



lie lie Ser Cys 
10 

Gly Trp Leu Trp 

2 5 

Thr Val Glu Leu 



Arg Asp Leu Leu 
60 

Ser Pro Glu His 
75 

Leu Cys Trp Gly 
90 

Leu Glu Asp Pro 
105 



Ser Cys Pro Thr 
15 

Gly Met Asp lie 

30 

Leu Ser Phe Leu 
45 

Asp Thr Ala Ser 



Cys Ser Pro His 
80 

Glu Leu Met Thr 
95 



Ala Ser Arg Asp 
110 



Leu Val Val Ser Tyr 
115 

Leu Leu Trp Phe His 
130 

lie Glu Tyr Leu Val 
145 

Tyr Lys Pro Pro Asn 
165 

Val Val Arg Arg Arg 
180 



Arg Arg Arg Arg Ser 
195 



Val Asn Thr Asn Met Gly 
120 



lie Ser Cys Leu Thr Phe 
135 



Ser Phe Gly Val Trp He 
150 155 



Ala Pro He Leu Ser Thr 
170 



Gly Arg Ser Pro Arq Arg 
18 5 



Gin Ser Pro Arg Arg Arg 
200 



Leu Lys Phe Arg Gin 
125 

Gly Arg Glu Thr Val 
140 

Arg Thr Pro Pro Ala 
160 

Leu Pro Glu Thr Thr 
175 

Arg Thr Pro Ser Pro 
190 

Arg Ser Gin Ser Arg 
205 



Gly Ser Gin Cys 
210 



<210> 98 
<2H> 183 
<212> PRT 

<213> Hepatitis B virus 
<400> 98 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 5 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Phe Arg Asp Ala Leu Glu Ser Pro Glu His Cys 
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35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Gly Asn Leu Glu Asp Pro Ala 
65 70 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

Phe Arg Gin Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Asp Thr Val He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Ser Asn Ala Pro He Leu Ser Thr Leu Pro 
130 " ~ 135 140 

Glu Thr Cys Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

Gin Ser Arg Glu Ser Gin Cys 
180 



<210> 99 
<211> 183 
<212> PRT 

<213> Hepatitis B virus 
<400> 99 

Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 " 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
65 70 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

Phe Arg Gin Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 ~ " 135 140 
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Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 ' 170 175 

Gin Ser Arg Glu Ser Gin Cys 
180 



<210> 100 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 100 

Met Gin Leu Phe His Leu Cys Leu lie lie Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser "Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp lie 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 4 0 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 ~ 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 ~ " 70 75 80 

His Thr Ala Leu Arg His Ala lie Leu Cys Trp Gly Asp Leu Arg Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Gly Asn Leu Glu Asp Pro lie Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 " 120 125 

Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 101 
<2H> 212 
<212> PRT 

<213> Hepatitis B virus 
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<400> 101 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Asp Met Asp He 
20 ~ 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Phe Arg Asp Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Ala Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Gin Ala 
145 *" 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Cys 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 "* 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
19.5 " 200 205 

Glu Ser Gin Cys 
210 



<210> 102 
<211> 183 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 

human Hepatitus B construct 

<400> 102 

Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 ■ 
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Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
65 70 J " 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

Phe Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 135 140 

Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

Gin Ser Arg Glu Ser Gin Cys 
180 



<210> 103 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 103 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Asp Leu Met Ser 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro He Ser Arg Asp 
100 " 105 " 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 
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Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 ~ 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 " 200 205 

Glu Ser Gin Cys 
210 



<210> 104 
<211> 183 
<212> PRT 

<213> Hepatitis B virus 
<400> 104 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Asp Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
65 70 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

Phe Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arq Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 " 135 140 

Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 " 170 175 

Gin Ser Arg Glu Ser Gin Cys 
18 0 



<210> 105 
<211> 183 
<212> PRT 

<213> Hepatitis B virus 



<400> 105 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
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15 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Asp 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
65 70 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

Phe Arg Gin Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 ~ 135 140 

Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 ~ 170 175 

Gin Ser Arg Glu Ser Gin Cys 
180 



<210> 106 
<211> 183 
<212> PRT 

<213> Hepatitis B virus 
<400> 106 

'Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Asp Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Ala Asn Leu Glu Asp Pro Ala 
65 70 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 ~ 90 95 

Phe Arg Gin Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg 
100 105 HO 



WO 01/85208 



PCT/IB01/00741 



-35- 

Glu Thr Val lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 ~ ~ 135 140 

Glu Thr Thr Val Val Arg Arg Arg Gly Arg Thr Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

Gin Ser Arg Glu Ser Gin Cys 
180 



<210> 107 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 107 

Met Gin Leu Phe His Leu Cys Leu lie He Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 " 55 60 

Ala Leu Tyr Arg Asp Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 . 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 
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<210> 108 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 108 

Met Gin Leu Phe His Leu Cys Leu lie lie Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 ~ 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Asp Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 " 135 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 ~ 150 " ~ 155 " 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 109 
<2H> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 109 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Thr Cys Pro Thr 
1 5 - 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp lie 
20 " ~ '25 30 



Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
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35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 " 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 ~ 120 ~ 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

He Glu Tyr Leu Val Ala Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 ~ 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
" 165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 110 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 110 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 ~ 55 60 

Ala Leu Tyr Arg Glu Ala Phe Glu Cys Ser Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Gly Asn Leu Glu Asp Pro He Ser Arg Asp 
100 ~ 105 HO 
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Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 ' 135 140 

lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 ' 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 111 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<220> 

<221> UNSURE 

<222> (28) . . (28) 

<22 3> May be any amino acid. 



<400> 111 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr 
15 10 15 . 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Xaa Asp Met Asp He 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 4 5 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 ~ 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Asp Leu He Thr 
85 90 95 

Leu Ser Thr Trp Val Gly Gly Asn Leu Glu Asp Pro Thr Ser Arg Asp 
100 105 HO 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr Pro Pro Ala 
145 150 155 160 
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Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Thr Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 112 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 112 

Met Gin Leu Phe His Leu Cys Leu lie He Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 '* 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Asn Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 ~ 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 " ~ 185 ' ~ 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 113 
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<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 113 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Gys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 ~ 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His He Cys Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 ' 135 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 ' 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 . ' 205 

Glu Ser Gin Cys 
210 



<210> 114 
<211> 212 
<212> PRT 
<213> Hepatitis 

<400> 114 
Met Gin Leu Phe 
1 

Val Gin Ala Ser 
20 



B virus 

His Leu Cys Leu 
5 

Lys Leu Cys Leu 



He He Ser Cys 
10 

Gly Trp Leu Trp 
25 



Ser Cys Pro Thr 
15 

Gly Met Asp He 
30 



Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 4 0 4 5 
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Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 " ~ 70 75 80 

His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu Thr Thr 
165 17 0 17 5 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Pro Gin Cys 
210 



<210> 115 
<211> 212 

<212> PRT " ' 

<213> Hepatitis B virus 

<400> 115 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Ser Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 * " 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 
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Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr Pro Pro Ala 
14 5 " 150 ~ 155 160 

Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 116 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 116 

Met Gin Leu Phe His Leu Cys Leu lie lie Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp lie 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 J 40 4 5 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 " 70 75 80 

His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 ~ 135 140 

lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr Pro Pro Ala 
14 5 ' 150 " 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Leu Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 
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Glu Ser Gin Cys 
210 



<210> 117 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 117 

Met Gin Leu Phe His Leu Cys Leu lie lie Ser Cys Ser Cys Pro Thr 
1 5 " 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp lie 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 " 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 7 0 7 5 8 0 

His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Asp Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 ~ 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Lys Gin 
115 ' 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu Thr Thr 
165 17 0 17 5 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arq Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 118 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 118 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr 
1 5 10 15 



Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
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20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ala 
50 " 55 60 

Ala Leu Tyr Arg Asp Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Thr Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 " 120 125 

Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 " 135 140 

Leu Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 119 
<211> 183 
<212> PRT 

<213> Hepatitis B virus 
<400> 119 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Ser Met Glu Leu Leu 
1 5 10 15 

Ser Phe Leu Pro Ser Asp Phe Tyr Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Thr Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Gly Asn Leu Gin Asp Pro Thr 
65 7 0 J 75 80 



Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 
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Phe Arg Gin Leu Leu Trp Phe His Val Ser Cys Leu Thr Phe Gly Arg 
100 " 105 110 

Glu Thr Val Val Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
115 120 125 

Pro Gin Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 ~ ~ 135 140 

Glu Thr Cys Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 " " 170 175 

Gin Ser Arg Glu Ser Gin Cys 
180 



<210> 120 
<211> 183 
<212> PRT 

<213> Hepatitis B virus 
<400> 120 

Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 " 5 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 " 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg His Val Phe Leu Cys Trp Gly Asp 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Gly Asn Leu Glu Asp Pro Thr 
65 70 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
8 5 90 95 

Phe Arg Gin Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 ~ 135 140 

Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 . 175 

Gin Ser Arg Glu Ser Gin Cys 
180 



<210> 121 
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<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 121 

Met Gin Leu Phe His Leu Cys Leu lie lie Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp lie 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Asp Leu Thr Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 122 
<211> 212 
<212> PRT 
<213> Hepatitis 

<400> 122 
Met Gin Leu Phe 
1 

Val Gin Ala Ser 
20 

Asp Pro Tyr Lys 
35 



B virus 

His Leu Cys Leu 
5 

Lys Leu Cys Leu 

Glu Phe Gly Ala 
40 



lie lie Ser Cys 
10 

Gly Trp Leu Trp 
25 

Thr Val Glu Leu 



Ser Cys Pro Thr 
15 

Gly Met Asp lie 
30 

Leu Ser Phe Leu 
45 
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Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Asp Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 ' ~ 70 75 80 

His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His lie Ser Cys Leu lie Phe Gly Arg Glu Thr Val 
130 135 140 

lie Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 • ' 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 123 
<211> 183 
<212> PRT 
<213> Hepatitis 

<400> 123 
Met Asp He Asp 
1 

Ser Phe Leu Pro 

20 

Thr Ala Ser Ala 
35 

Ser Pro His His 
50 

Leu Met Thr Leu 
65 

Ser Arg Asp Leu 



Phe Arg Gin Leu 
100 

Glu Thr Val He 
115 



B virus 



Pro Tyr Lys Glu 
5 



Ser Asp Phe Phe 



Leu Tyr Arg Glu 
40 



Thr Ala Leu Arg 
55 



Ala Thr Trp Val 
7 0 



Val Val Ser Tyr 
85 



Leu Trp Phe His 



Glu Tyr Leu Val 
120 



Phe Gly Ala Thr 
10 

Pro Ser Val Arg 
25 

Ala Leu Glu Ser 



Gin Ala He Leu 
60 



Gly Val Asn Leu 
75 

Val Asn Thr Asn 
90 

He Ser Cys Leu 
105 

Ser Phe Gly Val 



Val Glu Leu Leu 
15 

Asp Leu Leu Asp 
30 

Pro Glu His Cys 
45 

Cys Trp Gly Asp 



Glu Asp Pro Val 
80 

Val Gly Leu Lys 
95 

Thr Phe Gly Arg 
110 

Trp lie Arg Thr 
125 
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Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 ~ " 135 140 

Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 ~ 155 160 

Pro Ser Pro Ala Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

Gin Ser Arg Glu Ser Gin Cys 
- 180 



<210> 124 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 124 

Met Gin Leu Phe His Leu Cys Leu lie lie Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp lie 
20 " 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 ~ 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 ^ 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 ~ 70 75 • 80 

His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Asp Leu Met Asn 
85 90 95 

Leu Ala Thr Trp Val Gly Gly Asn Leu Glu Asp Pro Val Ser Arg Asp 
100 105 110 

Leu Val Val Gly Tyr Val Asn Thr Thr Val Gly Leu Lys Phe Arg Gin 
115 " ^ 120 125 

Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu Thr Thr 
165 17 0 17 5 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 125 
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<211> 183 
<212> PRT 

<213> Hepatitis B virus 
<400> 125 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 ' ' 5 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Asp Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Asp 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
65 70 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
1 85 90 95 

Phe Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 135 140 

Glu Thr Thr Val Val Arg Arg Arg Gly Arg Thr Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 ~ 170 175 

Gin Ser Arg Glu Ser Gin Cys 
180 



<210> 126 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 126 

Met Gin Leu Phe His Leu Cys Leu lie lie Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp lie 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 4 0 4 5 

Pro Ser Asp Phe Phe Pro Ser Val Arg Ala Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 
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His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

lie Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 . 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 127 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 127 

Met Gin Leu Phe His Leu Cys Leu He, He Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Asp Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Thr Arg Asp 
100 105 , 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Val Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 ~ 155 160 
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Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 ~ 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 ' 200 205 

Glu Ser Gin Cys 
210 



<210> 128 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 128 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 4 5 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Arg He Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 ~ 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Thr Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 129 
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<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 129 

Met Gin Leu Phe His Leu Cys Leu Val He Ser Cys Ser Cys Pro Thr 
1.5 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 ~ 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ala 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Asn Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lys He Arg Gin 
115 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

Leu Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 17 0 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 130 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 130 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr 
1 5 " 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 ■ 25 30 



Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 45 
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Pro Ser Ala Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 ^ 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Asp Leu Met Thr 
85 90 95 

Leu Ala Thr Trp Val Gly Val Asm Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 " 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 17 0 17 5 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 "* 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

Glu Ser Gin Cys 
210 



<210> 131 
<211> 183 
<212> PRT 
<213> Hepatitis 

<400> 131 
Met Asp He Asp 
1 

Ser Phe Leu Pro 
20 

Thr Ala Ala Ala 
35 

Ser Pro His His 
50 

Leu Met Thr Leu 
65 

Ser Arg Asp Leu 



lie Arg Gin Leu 
100 

Glu Thr Val Leu 
115 



B virus 



Pro Tyr Lys Glu 
5 

Ser Asp Phe Phe 



Leu Tyr Arg Glu 
40 

Thr Ala Leu Arg 
55 

Ala Thr Trp Val 
70 

Val Val Asn Tyr 
85 

Leu Trp Phe His 



Glu Tyr Leu Val 
120 



Phe Gly Ala Thr 
10 

Pro Ser Val Arg 

'•25 



Ala Leu Glu Ser 



Gin Ala He Leu 
60 

Gly Asn Asn Leu 
75 

Val Asn Thr Asn 
90 

He Ser Cys Leu 
105 

Ser Phe Gly Val 



Val Glu Leu Leu 
15 

Asp Leu Leu Asp 
30 

Pro Glu His Cys 
45 

Cys Trp Gly Glu 



Glu Asp Pro Ala 
80 

Met Gly Leu Lys 
95 

Thr Phe Gly Arg 
110 

Trp He Arg Thr 
125 
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Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 * 135 140 

Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 ~ ~ 155 " 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

Gin Ser Arg Glu Ser Gin Cys 
180 



<210> 132 
<211> 183 
<212> PRT 

<213> Hepatitis B virus 
<400> 132 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 5 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Gly Asn Leu Glu Asp Pro He 
65 70 ~ 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

Phe Arg Gin Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
115 120 ~ 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 135 140 

Glu Thr Cys Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

Gin Ser Arg Gly Ser Gin Cys 
180 

<210> 133 
<211> 3221 
<212> DNA 

<213> Hepatitis B virus 

<220> 
<221> CDS 

<222> (1901) . . (2458) 
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<400> 133 
ttccactgcc 


ttccaccaag 


CLCLgcdgyd 


t g gt g g c t c c 


ag l. Lcaggaa 


/ — t — . /-v 4- — i -a /-^ /—i r~\ 

Cdgi-aadCCL 


aatciccgcy 


aggacrgggg 


aCCCtlj LgaC 


aggacccctg 


/r|- n /~r 4— /-v- 4- 4— -ci /—i 

CLCgL.gLL.aC 


^gg^gggg u 


gcagagtcta 


gactcgtggt 


ggactt ct ct 


tggccaaaat 


tcgcagtccc 


caacct-ccaa 


tcctggttat 


cgctggatgt 


gtctgcggcg 


atgcctcatc 


4-4- „ 4- 4- _ 4- 4- 

ttc utattgg 


ttcttctgga 


aattccagga 


tcaacaacaa 


ecagtaeggg 


aggcaactct 


atgtttccct 


catigt L,gci.g 


tattcccatc 


ccatcgtcct 


gggctttege 


tttctcttgg 


ctcagtttac 


LSgL g.C C9T.L 


cgrtrggct:!: 


LlC agC td LdL 


ggatga L.g ug 


gagtcccttt 


atacegctgt 


4^ — * *-*\ 4— 4— 4— 4" 

taccaaiiLX l 


aacaaaacaa 


aaagatgggg 


ttattcccta 


ggaacattgc 


cacaggat ca 


4— -3 4— 4— iT 4— *3 /-I "3 

LdLLyLdCdd 


gttaacaggc 


✓"i 4~ *n 4— 4— y-^r •■a 4— 4— /"»r 

CLdLLgaLLg 


/■v 2 5 /"y4~ *3 4— r~< 4- 

ga.aagL.ci ug l 


gctccattta 


cacaatgtgg 


at at cctgcc 


aaacaggctt 


tcactttctc 


gccaacttac 


ct t t accccg 


ttgeteggea 


•n /-i /T 4~ /^x" r^r 4— 

dCygccLggL 


actggttggg 


gcLinggccaL. 


dygccdLCcig 


ccgat ccata 


ct geggaact 


cct agecget 


ctcatcggaa 


ct gacaattc 


4™ rt4" r^* 4~ r^t 4— /-t 

ngucg ucc lc 


ctaggctgta 


ct gecaactg 


gat cctt cgc 


ctgaat cccg 


cggacgaccc 


ct ctegggge 


ctgccgt t cc 


agccgaccac 


ggggcgcacc 


tctcatctgc 


cggtccgtgt 


gcacttcgct 


tgaacgccca 


tcagatcctg 


cccaaggtct 


tgtcaacgac 


cgaccttgag 


gcctacttca 


tgggggagga 


gattaggtta 


aaggtctttg 


gcgcaccagc 


accatgcaac 


tttttcacct 
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ccccagagtc 


aggggtctgt 


attttcctgc 


60 


tgctccgaat 


attgcctctc 


acatctcgtc 


120 


gaacatggag 


aacatcacat 


caggattcct 


180 


tttattgttg 


acaagaatcc 


tcacaatacc 


240 


caattttata 


gggggatcac 


ccgtgtgtct 


300 


tcactcacca 


acctcctgtc 


ctccaatttg 


360 


ttttatcata 


ttcctcttca 


tcctgctgct 


420 


ttatcaaggt 


atgttgcccg 


tttgtcctct 


480 


accatgeaaa 


acctgcacga 


ctcctgctca 


540 


tacaaaa'cet 


acggttggaa 


attgeacctg 


600 


aaaataccta 


tgggagtggg 


cctcagtccg 


660 


tgttcagtgg 


ttegtaggge 


tttcccccac 


720 


gtattggggg 


ccaagtctgt 


acagcatcgt 


780 


cttttgtctc 


tgggtataca 


tttaaaccct 


840 


aacttcatgg 


gttacataat 


tggaagttgg 


900 


aagatcaaac 


actgttttag 


aaaacttcct 


960 


caaagaattg 


tgggtctttt 


gggctttget 


1020 


ttaatgeett 


tgtatgcatg 


tatacaggct 


1080 


aaggecttte 


taagtaaaca 


gtacatgaac 


1140 


ctgtgccaag 


tgtttgctga 


cgcaaccccc 


1200 


cgcatgagtg 


gaacctttgt 


ggctcctctg 


1260 


tgtattgetc 


gcagccggtc 


tggagcaaag 


1320 


tegeggaaat 


atacategtt 


tccatggctg 


1380 


gggaegtect 


ttgtttacgt 


cccgtcggcg 


1440 


cgcttgggac 


tctatcgtcc 


ccttctccgt 


1500 


tctctttacg 


cggtctcccc 


gtctgtgcct 


1560 


tcacctctgc 


acgttgcatg 


gagaccaccg 


1620 


tacataagag 


gactcttgga 


ctcccagcaa 


1680 


aagactgtgt 


gtttaaggac 


tgggaggagc 


1740 


tattaggagg 


ctgtaggcat 


aaattggtct 


1800 


ctgcctaatc 


atctcttgta 


catgtcccac 


1860 
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tgttcaagcc tccaagctgt gccttgggtg gctttggggc atg gac att gac cct 1915 

Met Asp lie Asp Pro 
1 5 

tat aaa gaa ttt gga get act gtg gag tta etc teg ttt ttg cct tct 1963 
Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu Pro Ser 
10 15 20 

gac ttc ttt cct tec gtc aga gat etc eta gac ace gec tea get ctg 2011 
Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser Ala Leu 
25 30 35 

tat cga gaa gee tta gag tct cct gag cat tgc tea cct cac cat act 2059 
Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His His Thr 
40 45 50 

gca etc agg caa gee att etc tgc tgg ggg gaa ttg atg act eta get 2107 
Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu Leu Met Thr Leu Ala 
55 60 65 

acc tgg gtg ggt aat aat ttg gaa gat cca gca tec agg gat eta gta 2155 
Thr Trp Val Gly Asn Asn Leu Glu Asp Pro Ala Ser Arg Asp Leu Val 
70 75 80 85 

gtc aat tat gtt aat act aac atg ggt tta aag ate agg caa eta ttg 2203 
Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lys lie Arg Gin Leu Leu 
90 95 100 

tgg ttt cat ata tct tgc ctt act ttt gga aga gag act gta ctt gaa 2251 
Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val Leu Glu 
105 HO 115 

tat ttg gtc tct ttc gga gtg tgg att cgc act cct cca gee tat aga 2299 
Tyr Leu Val Ser "Phe Gly Val Trp He Arg Thr Pro Pro Ala Tyr Arg 
120 125 130 

cca cca aat gee cct ate tta tea aca ctt ccg gaa act act gtt gtt 2347 
Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr Val Val 
135 140 145 

aga cga egg gac cga ggc agg tec cct aga aga aga act ccc teg cct 2395 
Arg Arg Arg Asp Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
150 " ~ 155 160 165 

cgc aga cgc aga tct caa teg ccg cgt cgc aga aga tct caa tct egg 2443 
Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
170 175 180 

gaa tct caa tgt tag tattccttgg actcataagg tgggaaactt tactgggctt 24 98 
Glu Ser Gin Cys 
185 

tattcctcta cagtacctat ctttaatcct gaatggcaaa ctccttcctt tcctaagatt 2558 

catttacaag aggacattat tgataggtgt caacaatttg tgggccctct cactgtaaat 2 618 

gaaaagagaa gattgaaatt aattatgect gctagattct atcctaccca cactaaatat 2678 

ttgeccttag acaaaggaat taaaccttat tatccagatc aggtagttaa tcattacttc 2738 

caaaccagac attatttaca tactctttgg aaggctggta ttetatataa gagggaaacc 2798 

acaegtageg catcattttg cgggtcacca tattcttggg aacaagagct acagcatggg 2858 
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3 ft fx\~ 4~ fx fx\~ /—i -a 


4-* *T"" 3 2 3 Q /""l f 4— /~1 

LLdaaaCCLC 


f~< rt-j a a ft ft f* a T~ 

gCaadygCdL 


fxfxfx nrra a +~ 


— , J_ X, ^-,4- J- 


f* fi z> c* f* f ^~ f\~ 


9 Q1 £ 

Z. -3 J- O 




CCCydLOdtC 


■3 /t 4~ 4~ fx fx z2i r~\ f 

dy LuygaouL 


cyud LLuyya 


y LLdaO L-Octa, 


ana afpnaaa 
clocia. LUCdyd 




ttgggacttc 


aaccccatca 


aggaccactg 


gccagcagcc 


aaccaggtag 


gagtgggagc 


3038 


attcgggcca 


gggctcaccc 


ctccacacgg 


cggtattttg 


gggtggagcc 


ctcaggctca 


3098 


gggcatattg 


accacagtgt 


caacaattcc 


tcctcctgcc 


tccaccaatc 


ggcagtcagg 


3158 


aaggcagcct 


actcccatct 


ctccacctct 


aagagacagt 


catcctcagg 


ccatgcagtg 


3218 


gaa 












3221 



<210> 134 
<211> 185 
<212> PRT 

<213> Hepatitis B virus 



<400> 134 



Met 


Asp 


He 


Asp 


Pro 


Tyr 


Lys 


Glu 


Phe 


Gly 


Ala 


Thr 


Val 


Glu 


Leu 


Leu 


1 








5 










10 










15 




Ser 


Phe 


Leu 


Pro 


Ser 


Asp 


Phe 


Phe 


Pro 


Ser 


Val 


Arg 


Asp 


Leu 


Leu 


Asp 








20 










25 










30 






Thr 


Ala 


Ser 


Ala 


Leu 


Tyr 


Arg 


Glu 


Ala 


Leu 


Glu 


Ser 


Pro 


Glu 


His 


Cys 






35 










40 










45 








Ser 


Pro 


His 


His 


Thr 


Ala 


Leu 


Arg 


Gin 


Ala 


He 


Leu 


Cys 


Trp 


Gly 


Glu 




50 










55 










60 










Leu 


Met 


Thr 


Leu 


Ala 


Thr 


Trp 


Val 


Gly 


Asn 


Asn 


Leu 


Glu 


Asp 


Pro 


Ala 


65 










70 








75 










80 


Ser 


Arg 


Asp 


Leu 


Val 


Val 


Asn 


Tyr 


Val 


Asn 


Thr 


Asn 


Met 


Gly 


Leu 


Lys 








85 










90 










95 




He 


Arg 


Gin 


Leu 


Leu 


Trp 


Phe 


His 


He 


Ser 


Cys 


Leu 


Thr 


Phe 


Gly 


Arg 








100 










105 










110 






Glu 


Thr 


Val 


Leu 


Glu 


Tyr 


Leu 


Val 


Ser 


Phe 


Gly Val 


Trp 


He 


Arg 


Thr 






115 










120 










125 








Pro 


Pro 


Ala 


Tyr 


Arg 


Pro 


Pro 


Asn 


Ala 


Pro 


He 


Leu 


Ser 


Thr 


Leu 


Pro 




130 










135 










140 










Glu 


Thr 


Thr 


Val 


Val 


Arg 


Arg 


Arg 


Asp 


Arg 


Gly Arg 


Ser 


Pro 


Arg 


Arg 


145 










150 










155 










160 


Arg 


Thr 


Pro 


Ser 


Pro 


Arg 


Arg 


Arg 


Arg 


Ser 


Gin 


Ser 


Pro 


Arg 


Arg 


Arg 










165 










170 










175 




Arg 


Ser 


Gin 


Ser 


Arg 


Glu 


Ser 


Gin 


Cys 

















180 185 



<210> 135 
<2H> 188 
<212> PRT 

<213> Woodchuck hepatitis B virus 
<400> 135 

Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ser Ser Tyr Gin Leu Leu 
1 ~ 5 10 15 

Asn Phe Leu Pro Leu Asp Phe Phe Pro Asp Leu Asn Ala Leu Val Asp 
2 0 25 30 



Thr Ala Thr Ala Leu Tyr Glu Glu Glu Leu Thr Gly Arg Glu His Cys 
35 " 40 45 
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Ser Pro His His Thr Ala lie Arg Gin Ala Leu Val Cys Trp Asp Glu 
50 55 60 

Leu Thr Lys Leu lie Ala Trp Met Ser Ser Asn lie Thr Ser Glu Gin 
65 70 75 80 

Val Arg Thr lie lie Val Asn His Val Asn Asp Thr Trp Gly Leu Lys 
85 90 95 

Val Arg Gin Ser Leu Trp Phe His Leu Ser Cys Leu Thr Phe Gly Gin 
100 105 110 

His Thr Val Gin Glu Phe Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

Pro Ala Pro Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 ^ 135 140 

Glu His Thr Val lie Arg Arg Arg Gly Gly Ala Arg Ala Ser Arg Ser 
145 150 155 160 

Pro Arg Arg Arg Thr Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro 
165 170 175 

Arg Arg Arg Arg Ser Gin Ser Pro Ser Thr Asn Cys 
180 18 5 



<210> 136 
<211> 217 
<212> PRT 

<213> Ground squirrel hepatitis virus 
<400> 136 

Met Tyr Leu Phe His Leu Cys Leu Val Phe Ala Cys Val Pro Cys Pro 
15 10 15 

Thr Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Asp Met Asp 
20 ~ 25 30 

lie Asp Pro Tyr Lys Glu Phe Gly Ser Ser Tyr Gin Leu Leu Asn Phe 
35 4 0 4 5 

Leu Pro Leu Asp Phe Phe Pro Asp Leu Asn Ala Leu Val Asp Thr Ala 
50 ~ 55 60 

Ala Ala Leu Tyr Glu Glu Glu Leu Thr Gly Arg Glu His Cys Ser Pro 
65 ~ 70 75 80 

His His Thr Ala He Arg Gin Ala Leu Val Cys Trp Glu Glu Leu Thr 
85 90 , 95 

Arg Leu He Thr Trp Met Ser Glu Asn Thr Thr Glu Glu Val Arg Arg 
100 ~ 105 110 

He He Val Asp His Val Asn Asn Thr Trp Gly Leu Lys Val Arg Gin 
115 120 125 

Thr Leu Trp Phe His Leu Ser Cys Leu Thr Phe Gly Gin His Thr Val 
130 135 140 

Gin Glu Phe Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Ala Pro 
145 150 155 ~ 160 
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Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu His Thr 
165 170 175 

Val lie Arg Arg Arg Gly Gly Ser Arg Ala Ala Arg Ser Pro Arg Arg 
180 " 185 190 

Arg Thr Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg 
195 " 200 205 

Arg Ser Gin Ser Pro Ala Ser Asn Cys 



<210> 137 
<211> 262 
<212> PRT 

<213> Snow Goose Hepatitis B Virus 
<400> 137 

Met Asp Val Asn Ala Ser Arg Ala Leu Ala Asn Val Tyr Asp Leu Pro 
15 10 15 

Asp Asp Phe Phe Pro Lys He Glu Asp Leu Val Arg Asp Ala Lys Asp 
20 25 30 

Ala Leu Glu Pro Tyr Trp Lys Ser Asp Ser lie Lys Lys His Val Leu 
35 ~ ~ 40 45 

He Ala Thr His Phe Val Asp Leu He Glu Asp Phe Trp Gin Thr Thr 
50 55 60 

Gin Gly Met His Glu He Ala Glu Ala He Arg Ala- Val He Pro Pro 
65 " 70 75 80 

Thr Thr Ala Pro Val Pro Ser Gly Tyr Leu He Gin His Asp Glu Ala 
85 90 95 

Glu Glu He Pro Leu Gly Asp Leu Phe Lys Glu Gin Glu Glu Arg He 
100 ' 105^ 110 

Val Ser Phe Gin Pro Asp Tyr Pro He Thr Ala Arg He His Ala His 
115 120 125 

Leu Lys Ala Tyr Ala Lys He Asn Glu Glu Ser Leu Asp Arg Ala Arg 
130 ~ ~ 135 140 

Arg Leu Leu Trp Trp His Tyr Asn Cys Leu Leu Trp Gly Glu Ala Thr 
145 150 155 160 

Val Thr Asn Tyr He Ser Arg Leu Arg Thr Trp Leu Ser Thr Pro Glu 
165 170 175 

Lys Tyr Arg Gly Arg Asp Ala Pro Thr He Glu Ala He Thr Arg Pro 
180 185 190 

He Gin Val Ala Gin Gly Gly Arg Lys Thr Ser Thr Ala Thr Arg Lys 
195 200 205 

Pro Arg Gly Leu Glu Pro Arg Arg Arg Lys Val Lys Thr Thr Val Val 
210 215 220 

Tyr Gly Arg Arg Arg Ser Lys Ser Arg Glu Arg Arg Ala Ser Ser Pro 



210 



215 



225 



230 



235 



240 
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Gln Arg Ala Gly Ser Pro Leu Pro Arg Ser Ser Ser Ser His His Arg 
245 250 255 

Ser Pro Ser Pro Arg Lys 
260 



<210> 138 

<211> 305 

<212> PRT 

<213> Duck hepatitis B virus 

<40'0> 138 

Met Trp Asp Leu Arg Leu His Pro Ser Pro Phe Gly Ala Ala Cys Gin 

1 ~ 1 5 10 15 

Gly lie Phe Thr Ser Ser Leu Leu Leu Phe Leu Val Thr Val Pro Leu 
20 25 30 

Val Cys Thr lie Val Tyr Asp Ser Cys Leu Cys Met Asp He Asn Ala 
35 * 40 45 

Ser Arg Ala Leu Ala Asn Val Tyr Asp Leu Pro Asp Asp Phe Phe Pro 
50 55 60 

Lys He Asp Asp Leu Val Arg Asp Ala Lys Asp Ala Leu Glu Pro Tyr 
65 ~ 70 75 80 

Trp Arg Asn Asp Ser He Lys Lys His Val Leu He Ala Thr His Phe 
85 90 95 

Val Asp Leu He Glu Asp Phe Trp Gin Thr Thr Gin Gly Met His Glu 
100 " 105 110 

lie Ala Glu Ala Leu Arg Ala He He Pro Ala Thr Thr Ala Pro Val 
115 120 125 

Pro Gin Gly Phe Leu Val Gin His Glu Glu Ala Glu Glu He Pro Leu 
130 "* 135 140 

Gly Glu Leu Phe Arg Tyr Gin Glu Glu Arg Leu Thr Asn Phe Gin Pro 
145 150 155 160 

Asp Tyr Pro Val Thr Ala Arg He His Ala His Leu Lys Ala Tyr Ala 
165 170 175 

Lys He Asn Glu Glu Ser Leu Asp Arg Ala Arg Arg Leu Leu Trp Trp 
180 185 190 

His Tyr Asn Cys Leu Leu Trp Gly Glu Pro Asn Val Thr Asn Tyr He 
195 200 205 

Ser Arg Leu Arg Thr Trp Leu Ser Thr Pro Glu Lys Tyr Arg Gly Lys 
210 ~ 215 220 

Asp Ala Pro Thr He Glu Ala He Thr Arg Pro He Gin Val Ala Gin 
225 230 235 240 

Gly Gly Arg Asn Lys Thr Gin Gly Val Arg Lys Ser Arg Gly Leu Glu 
245 250 255 

Pro Arg Arg Arg Arg Val Lys Thr Thr He Val Tyr Gly Arg Arg Arg 
2 60 2 65 27 0 
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Ser Lys Ser Arg Glu Arg Arg Ala Pro Thr Pro Gin Arg Ala Gly Ser 
275 " 280 285 

Pro Leu Pro Arg Thr Ser Arg Asp His His Arg Ser Pro Ser Pro Arg 
290 295 300 

Glu 
305 



<210> 139 
<211> 212 
<212> PRT 

<213> Haemophilus influenzae 
<400> 139 

Met Lys Lys Thr Leu Leu Gly Ser Leu lie Leu Leu Ala Phe Ala Gly 
15 10 15 

Asn Val Gin Ala Ala Ala Asn Ala Asp Thr Ser Gly Thr Val Thr Phe 
20 25 30 

Phe Gly Lys Val Val Glu Asn Thr Cys Gin Val Asn Gin Asp Ser Glu 
35 40 45 

Tyr Glu Cys Asn Leu Asn Asp Val Gly Lys Asn His Leu Ser Gin Gin 
50 55 " 60 

Gly Tyr Thr Ala Met Gin Thr Pro Phe Thr lie Thr Leu Glu Asn Cys 
65 70 75 80 

Asn Val Thr Thr Thr Asn Asn Lys Pro Lys Ala Thr Lys Val Gly Val 
85 90 95 

Tyr Phe Tyr Ser Trp Glu lie Ala Asp Lys Asp Asn Lys Tyr Thr Leu 
100 105 110 

Lys Asn lie Lys Glu Asn Thr Gly Thr Asn Asp Ser Ala Asn Lys Val 
115 120 125 

Asn lie Gin Leu Leu Glu Asp Asn Gly Thr Ala Glu lie Lys Val Val 
130 135 140 

Gly Lys Thr Thr Thr Asp Phe Thr Ser Glu Asn His Asn Gly Ala Gly 
145 150 155 160 

Ala Asp Pro Val Ala Thr Asn Lys His He Ser Ser Leu Thr Pro Leu 
165 170 175 

Asn Asn Gin Asn Ser He Asn Leu His Tyr He Ala Gin Tyr Tyr Ala 
180 185 190. 

Thr Gly Val Ala Glu Ala Gly Lys Val Pro Ser Ser Val Asn Ser Gin 
195 200 205 

He Ala Tyr Glu 
210 



<210> 140 
<2H> 139 
<212> PRT 

<213> Pseudomonas stutzeri 
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<400> 140 

Met Lys Ala Gin Met Gin Lys Gly Phe Thr Leu lie Glu Leu Met lie 
15 10 15 

Val Val Ala lie lie Gly lie Leu Ala Ala He Ala Leu Pro Ala Tyr 
20 25 30 

Gin Asp Tyr Thr Val Arg Ser Asn Ala Ala Ala Ala Leu Ala Glu He 
35 . 40 45 

Thr Pro Gly Lys He Gly Phe Glu Gin Ala He Asn Glu Gly Lys Thr 
50 ' 55 60 

Pro Ser Leu Thr Ser Thr Asp Glu Gly Tyr He Gly He Thr Asp Ser 
65 70 75 80 

Thr Ser Tyr Cys Asp Val Asp Leu Asp Thr Ala Ala Asp Gly His He 
"85 90 95 

Glu Cys Thr Ala Lys Gly Gly Asn Ala Gly Lys Phe Asp Gly Lys Thr 
100 105 110 

He Thr Leu Asn Arg Thr Ala Asp Gly Glu Trp Ser Cys Ala Ser Thr 
115 120 125 

Leu Asp Ala Lys Tyr Lys Pro Gly Lys Cys Ser 
130 135 



<210> 141 
<211> 59 
<212> PRT 

<213> Caulobacter crescentus 
<400> 141 

Met Thr Lys Phe Val Thr Arg Phe Leu Lys Asp Glu Ser Gly Ala Thr 
15 10 15 

Ala He Glu Tyr Gly Leu He Val Ala Leu lie Ala Val Val He Val 
20 25 30 

Thr Ala Val Thr Thr Leu Gly Thr Asn Leu Arg Thr Ala Phe Thr Lys 
35 40 45 

Ala Gly Ala Ala Val Ser Thr Ala Ala Gly Thr 
50 55 



<210> 142 
<2H> 173 
<212> PRT 

<213> Escherichia coli 
<400> 142 

Met Ala Val Val Ser Phe Gly Val Asn Ala Ala Pro Thr He Pro Gin 
15 10 15 

Gly Gin Gly Lys Val Thr Phe Asn Gly Thr Val Val Asp Ala Pro Cys 
20 25 30 

Ser He Ser Gin Lys Ser Ala Asp Gin Ser He Asp Phe Gly Gin Leu 
35 40 4 5 
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Ser Lys Ser Phe Leu Glu Ala Gly Gly Val Ser Lys Pro Met Asp Leu 
50 55 60 

Asp lie Glu Leu Val Asn Cys Asp lie Thr Ala Phe Lys Gly Gly Asn 
65 70 75 80 

Gly Ala Gin Lys Gly Thr Val Lys Leu Ala Phe Thr Gly Pro lie Val 
85 90 95 

Asn Gly His Ser Asp Glu Leu Asp Thr Asn Gly Gly Thr Gly Thr Ala 
100 ~ 105 110 

lie Val Val Gin Gly Ala Gly Lys Asn Val Val Phe Asp Gly Ser Glu 
115 120 125 

Gly Asp Ala Asn Thr Leu Lys Asp Gly Glu Asn Val Leu His Tyr Thr 
130 135 140 

Ala Val Val Lys Lys Ser Ser Ala Val Gly Ala Ala Val Thr Glu Gly 
145 150 155 160 

Ala Phe Ser Ala Val Ala Asn Phe Asn Leu Thr Tyr Gin 
165 170 



<210> 143 
<211> 173 
<212> PRT 

<213> Escherichia coli 
<400> 143 

Met Ala Val Val Ser Phe Gly Val Asn Ala Ala Pro Thr lie Pro Gin 
15 10 15 

Gly Gin Gly Lys Val Thr Phe Asn Gly Thr Val Val Asp Ala Pro Cys 
20 25 30 

Ser lie Ser Gin Lys Ser Ala Asp Gin Ser lie Asp Phe Gly Gin Leu 
35 40 45 

Ser Lys Ser Phe Leu Glu Ala Gly Gly Val Ser Lys Pro Met Asp Leu 
50 55 60 

Asp He Glu Leu Val Asn Cys Asp He Thr Ala Phe Lys Gly Gly Asn 
65 70 75 80 

Gly Ala Gin Lys Gly Thr Val Lys Leu Ala Phe Thr Gly Pro He Val 
85 90 95 

Asn Gly His Ser Asp Glu Leu Asp Thr Asn Gly Gly Thr Gly Thr Ala 
100 105 110 

He Val Val Gin Gly Ala Gly Lys Asn Val Val Phe Asp Gly Ser Glu 
115 120 125 

Gly Asp Ala Asn Thr Leu Lys Asp Gly Glu Asn Val Leu His Tyr Thr 
130 135 140 

Ala Val Val Lys Lys Ser Ser Ala Val Gly Ala Ala Val Thr Glu Gly 
145 150 155 160 

Ala Phe Ser Ala Val Ala Asn Phe Asn Leu Thr Tyr Gin 
165 170 
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<210> 144 
<211> 172 
<212> PRT 

<213> Escherichia coli 
<400> 144 

Met Ala Val Val Ser Phe Gly Val Asn Ala Ala Pro Thr Thr Pro Gin 
15 10 15 

Gly Gin Gly Arg Val Thr Phe Asn Gly Thr Val Val Asp Ala Pro Cys 
20 25 30 

Ser lie Ser Gin Lys Ser Ala Asp Gin Ser lie Asp Phe Gly Gin Leu 
35 40 45 

Ser Lys Ser Phe Leu Ala Asn Asp Gly Gin Ser Lys Pro Met Asn Leu 
50 55 60 

Asp lie Glu Leu Val Asn Cys Asp lie Thr Ala Phe Lys Asn Gly Asn 
65 70 75 80 

Ala Lys Thr Gly Ser Val Lys Leu Ala Phe Thr Gly Pro Thr Val Ser 
85 90 95 

Gly His Pro Ser Glu Leu Ala Thr Asn Gly Gly Pro Gly Thr Ala lie 
100 105 110 

Met He' Gin Ala Ala Gly Lys Asn Val Pro Phe Asp Gly Thr Glu Gly 
115 120 125 

Asp Pro Asn Leu Leu Lys Asp Gly Asp Asn Val Leu His Tyr Thr Thr 
130 135 140 

Val Gly Lys Lys Ser Ser Asp Gly Asn Ala Gin He Thr Glu Gly Ala 
145 ~ 150 155 160 

Phe Ser Gly Val Ala Thr Phe Asn Leu Ser Tyr Gin 
165 170 



<210> 145 
<211> 853 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (281) . . (829) 

<400> 145 

acgtttctgt ggctcgacgc atcttcctca ttcttctctc caaaaaccac ctcatgcaat 60 

ataaacatct ataaataaag ataacaaata gaatattaag ccaacaaata aactgaaaaa 120 

gtttgtccgc gatgctttac ctctatgagt caaaatggcc ccaatgtttc atcttttggg 180 

ggaaactgtg cagtgttggc agtcaaactc gttgacaaac aaagtgtaca gaacgactgc 240 

ccatgtcgat ttagaaatag ttttttgaaa ggaaagcagc atg aaa att aaa act 295 

Met Lys He Lys Thr 
1 5 

ctg gca ate gtt gtt ctg teg get ctg tec etc agt tct acg acg get 343 
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Leu Ala lie Val Val Leu Ser Ala Leu Ser Leu Ser Ser Thr Thr Ala 
10 15 20 

ctg gcc get gec acg acg gtt aat ggt ggg acc gtt cac ttt aaa ggg 391 
Leu Ala Ala Ala Thr Thr Val Asn Gly Gly Thr Val His Phe Lys Gly 
25 30 35 

gaa gtt gtt aac gcc get tgc gca gtt gat gca ggc tct gtt gat caa 439 
Glu Val Val Asn Ala Ala Cys Ala Val Asp Ala Gly Ser Val Asp Gin 
40 45 50 

acc gtt cag tta gga cag gtt cgt acc gca teg ctg gca cag gaa gga 487 
Thr Val Gin Leu Gly Gin Val Arg Thr Ala Ser Leu Ala Gin Glu Gly 
55 60 65 

gca acc agt tct get gtc ggt ttt aac att cag ctg aat gat tgc gat 535 
Ala Thr Ser Ser Ala Val Gly Phe Asn lie Gin Leu Asn Asp Cys Asp 
70 75 80 85 

acc aat gtt gca tct aaa gcc get gtt gcc ttt tta ggt acg gcg att 583 
Thr Asn Val Ala Ser Lys Ala Ala Val Ala Phe Leu Gly Thr Ala He 
90 95 100 

gat gcg ggt cat acc aac gtt ctg get ctg cag agt tea get gcg ggt 631 
Asp Ala Gly His Thr Asn Val Leu Ala Leu Gin Ser Ser Ala Ala Gly 
105 110 115 

age gca aca aac gtt ggt gtg cag ate ctg gac aga acg ggt get gcg 67 9 
Ser Ala Thr Asn Val Gly Val Gin He Leu Asp Arg Thr Gly Ala Ala 
120 125 130 

ctg acg ctg gat ggt gcg aca ttt agt tea gaa aca acc ctg aat aac 727 
Leu Thr Leu Asp Gly Ala Thr Phe Ser Ser Glu Thr Thr Leu Asn Asn 
135 140 145 

gga acc aat acc att ccg ttc cag gcg cgt tat ttt gca acc ggg gcc 775 
Gly Thr Asn Thr He Pro Phe Gin Ala Arg Tyr Phe Ala Thr Gly Ala 
150 155 160 165 

gca acc ccg ggt get get aat gcg gat gcg acc ttc aag gtt cag tat 823 
Ala Thr Pro Gly Ala Ala Asn Ala Asp Ala Thr Phe Lys Val Gin Tyr 
170 175 180 

caa taa cctacctagg ttcagggacg ttca 853 
Gin 

* 

<210> 146 
<211> 182 
<212> PRT 

<213> Escherichia coli 
<400> 146 

Met Lys He Lys Thr Leu Ala He Val Val Leu Ser Ala Leu Ser Leu 

15 10 15 

Ser Ser Thr Thr Ala Leu Ala Ala Ala Thr Thr Val Asn Gly Gly Thr 

20 25 30 

Val His Phe Lys Gly Glu Val Val Asn Ala Ala Cys Ala Val Asp Ala 

35 40 ' * 45 

Gly Ser Val Asp Gin Thr Val Gin Leu Gly Gin Val Arg Thr Ala Ser 

50 55 60 

Leu Ala Gin Glu Gly Ala Thr Ser Ser Ala Val Gly Phe Asn He Gin 
65 70 75 80 
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Leu 


Asn 


Asp 


Cys 


Asp 


Thr 


Asn 


Val 


Ala 


Ser 


Lys 


Ala 


Ala 


Val 


Ala 


Phe 








85 










90 










95 




Leu 


Gly 


Thr 


Ala 


He 


Asp 


Ala 


Gly 


His 


Thr 


Asn 


Val 


Leu 


Ala 


Leu 


Gin 






100 










105 










110 






Ser 


Ser 


TV "1 

Ala 


7\ 1 

Ala 


Gly 


Ser 


Ala 


Thr 


Asn 


Val 


Gly Val 


Cain 


lie 


.Leu 


Asp 






115 








120 










125 








Arg 


Thr 


Gly 


Ala 


Ala 


Leu 


Thr 


Leu 


Asp 


Gly 


Ala 


Thr 


Phe 


Ser 


Ser 


Glu 




130 










135 










140 










Thr 


Thr 


Leu 


Asn 


Asn 


Gly 


Thr 


Asn 


Thr 


He 


Pro 


Phe 


Gin 


Ala 


Arg 


Tyr 


145 










150 










155 










160 


Phe 


Ala 


Thr 


Gly 


Ala 


Ala 


Thr 


Pro 


Gly Ala 


Ala 


Asn 


Ala 


Asp 


Ala 


Thr 








165 










170 










175 




Phe 


Lys 


Val 


Gin 


Tyr 


Gin 























180 



<210> 147 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> FLAG peptide 
<400> 147 

Cys Gly Gly Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 10 



<210> 148 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 148 

ccggaattca tggacattga cccttataaa g 31 



<210> 149 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 149 

gtgcagtatg gtgaggtgag gaatgctcag gagactc 37 



<210> 150 
<2H> 37 
<212> DNA 
<213> Artificial 

<220> 

<223> primer 



Sequence 



<400> 150 
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gsgtctcctg agcattcctc acctcaccat actgcac 37 

<210> 151 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 151 

cttccaaaag tgagggaaga aatgtgaaac cac 33 

<210> 152 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 152 

cgcgtcccaa gcttctaaac aacagtagtc tccggaagcg ttgatag 47 

<210> 153 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 153 

gtggtttcac atttcttccc tcacttttgg aag 33 

<210> 154 
<211> 281 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 154 

Met Ser Glu Tyr Gin Pro Ser Leu Phe Ala Leu Asn Pro Met Gly Phe 
15 10 15 

Ser Pro Leu Asp Gly Ser Lys Ser Thr Asn Glu Asn Val Ser Ala Ser 
20 25 30 

Thr Ser Thr Ala Lys Pro Met Val Gly Gin Leu lie Phe Asp Lys Phe 
35 40 45 

lie Lys Thr Glu Glu Asp Pro lie lie Lys Gin Asp Thr Pro Ser Asn 
50 55 60 

Leu Asp Phe Asp Phe Ala Leu Pro Gin Thr Ala Thr Ala Pro Asp Ala 
65 70 75 80 

Lys Thr Val Leu Pro lie Pro Glu Leu Asp Asp Ala Val Val Glu Ser 
85 90 95 
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Phe Phe Ser Ser Ser Thr Asp Ser Thr Pro Met Phe Glu Tyr Glu Asn 
100 105 110 

Leu Glu Asp Asn Ser Lys Glu Trp Thr Ser Leu Phe Asp Asn Asp lie 
115 120 125 

Pro Val Thr Thr Asp Asp Val Ser Leu Ala Asp Lys Ala He Glu Ser 
130 135 140 

Thr Glu Glu Val Ser Leu Val Pro Ser Asn Leu Glu Val Ser Thr Thr 
145 150 155 160 

Ser Phe Leu Pro Thr Pro Val Leu Glu Asp Ala Lys Leu Thr Gin Thr 
165 170 175 

Arg Lys Val Lys Lys Pro Asn Ser Val Val Lys Lys Ser His His Val 
180 185 190 

Gly Lys Asp Asp Glu Ser Arg Leu Asp His Leu Gly Val Val Ala Tyr 
195 200 205 

Asn Arg Lys Gin Arg Ser He Pro Leu Ser Pro He Val Pro Glu Ser 
210 " ~ 215 220 

Ser Asp Pro Ala Ala Leu Lys Arg Ala Arg Asn Thr Glu Ala Ala Arg 
225 230 235 240 

Arg Ser Arg Ala Arg Lys Leu Gin Arg Met Lys Gin Leu Glu Asp Lys 
245 250 255 

Val Glu Glu Leu Leu Ser Lys Asn Tyr His Leu Glu Asn Glu Val Ala 
260 265 270 

Arg Leu Lys Lys Leu Val Gly Glu Arg 
275 ~ 280 

<210> 155 
<211> 181 
<212> PRT 

<213> Escherichia coli 
<400> 155 

Met Lys He Lys Thr Leu Ala He Val Val Leu Ser Ala Leu Ser Leu 
1 5 10 15 

Ser Ser Thr Ala Ala Leu Ala Ala Ala Thr Thr Val Asn Gly Gly Thr 
20 25 30 

Val His Phe Lys Gly Glu Val Val Asn Ala Ala Cys Ala Val Asp Ala 
35 40 45 

Gly Ser Val Asp Gin Thr Val Gin Leu Gly Gin Val Arg Thr Ala Ser 
50 55 60 

Leu Ala Gin Glu Gly Ala Thr Ser Ser Ala Val Gly Phe Asn He Gin 
65 70 75 80 

Leu Asn Asp Cys Asp Thr Asn Val Ala Ser Lys Ala Ala Val Ala Phe 
85 90 95 

Leu Gly Thr Ala He Asp Ala Gly His Thr Asn Val Leu Ala Leu Gin 
100 105 HO 
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'Ser Ser Ala Ala Gly Ser Ala Thr Asn Val Gly Val Gin lie Leu Asp 
115 120 125 

Arg Thr Gly Ala Ala Leu Thr Leu Asp Gly Ala Thr Phe Ser Ser Glu 
130 135 140 

Thr Thr Leu Asn Asn Gly Thr Asn Thr lie Pro Phe Gin Ala Arg Tyr 

145 150 155 160 

Phe Ala Gly Ala Ala Thr Pro Gly Ala Ala Asn Ala Asp Ala Thr Phe 
165 170 175 



Lys Val Gin Tyr Gin 
180 



<210> 156 
<211> 447 
<212> DNA 
<213> Hepatitis B 

<220> 

<221> CDS 

<222> (1) . . (447) 

<400> 156 

atg gac att gac cct tat aaa gaa ttt gga get act gtg gag tta etc 48 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 

1 5 10 15 

teg ttt ttg cct tct gac ttc ttt cct tec gta cga gat ctt eta gat 9 6 
Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

acc gec gca get ctg tat egg gat gec tta gag tct cct gag cat tgt 144 
Thr Ala Ala Ala Leu Tyr Arg Asp Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

tea cct cac cat act gca etc agg caa gca att ctt tgc tgg gga gac 192 
Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Asp 
50 55 60 

tta atg act eta get acc tgg gtg ggt act aat tta gaa gat cca gca 240 
Leu Met Thr Leu Ala Thr Trp Val Gly Thr Asn Leu Glu Asp Pro Ala 
65 70 75 80 

tct agg gac eta gta gtc agt tat gtc aac act aat gtg ggc eta aag 288 
Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Val Gly Leu Lys 
85 90 95 

ttc aga caa tta ttg tgg ttt cac att tct tgt etc act ttt gga aga 336 
Phe Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

gaa acg gtt eta gag tat ttg gtc tct ttt gga gtg tgg att cgc act 384 
Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
115 120 125 

cct cca gee tat aga cca cca aat gec cct ate eta tea acg ctt ccg 432 
Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 135 140 



gag act act gtt gtt 



447 
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Glu Thr Thr Val Val 
145 



<210> 157 
<211> 149 
<212> PRT 
<213> Hepatitis B 

<400> 157 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 5 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ala Ala Leu Tyr Arg Asp Ala Leu Glu Ser Pro Glu His Cys 
35 " 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Asp 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Thr Asn Leu Glu Asp Pro Ala 
65 70 " 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Val Gly Leu Lys 
85 90 95 

Phe Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 135 140 

Glu Thr Thr Val Val 
145 



<210> 158 
<211> 152 
<212> PRT 

<213> Hepatitis B ; 
<400> 158 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 5 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ala Ala Leu Tyr Arg Asp Ala Leu Glu Ser Pro Glu His Cys 
35 4 0 4 5 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Asp 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Thr Asn Leu Glu Asp Gly Gly 
65 70 75 80 
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Lys Gly Gly 



Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Val 
85 90 95 



Gly Leu Lys 



Phe Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr 
100 105 110 



Phe Gly Arg 
115 



Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp 
120 125 



lie Arg Thr 
130 



Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser 
135 140 



Thr Leu Pro 
145 



Glu Thr Thr Val Val 
150 



<210> 159 

<211> 56 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 159 

tagatgatta cgccaagctt ataatagaaa tagttttttg aaaggaaagc agcatg 56 



<210> 160 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 160 

gtcaaaggcc ttgtcgacgt tattccatta cgcccgtcat tttgg 45 



<210> 161 

<211> 4623 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pFIMAIC 



<400> 161 

agacgaaagg gcctcgtgat acgcctattt ' ttataggtta atgtcatgat aataatggtt 60 

tcttagacgt caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt 120 

ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa 180 

taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt 240 
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1 ft fiO 

X O D U 


tgctggcctt 


ttgctcacat 


gttctttcct 


gcgttatccc 


ctgattctgt 


ggataaccgt 


1920 


attaccgcct 


ttgagtgagc 


tgataccgct 


cgccgcagcc 


gaacgaccga 


gcgcagcgag 


1980 


tcagtgagcg 


aggaagcgga 


agagcgccca 


atacgcaaac 


cgcctctccc 


cgcgcgttgg 


2040 


ccgattcatt 


aatgcagctg 


gcacgacagg 


tttcccgact 


ggaaagcggg 


cagtgagcgc 


2100 


aacgcaatta 


atgtgagtta 


gctcactcat 


taggcacccc 


aggctttaca 


ctttatgctt 


2160 
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ccppptppta 


tcrtt crtcrtaa 


aattataaar 


ggataacaat 


ttcacacagg 


aaacagctat 


2220 


y L< U CL L y CL L L 


araccaaact 

Cl V-^ y C» t — L \^ C 


tataataaaa 


atacrtttttt 


gaaaggaaag 


cagcat gaaa 


2280 


at'haaaap'tp 1 

Cl L L Ci Cl Cl Cl U l u 


t p pp 3 a i~ crrt" 

luu l< gl a. l uy L 


t" rri - 1" pt" pt" pp 

cy c uu l.u c v> y 


PPtptptpPP 


tcaattctar 


appppptptp 

ci y u y y u l u l y 


2340 


ppprrp , trTP , r , a 
y l> u y v — i.y uoq 


prra pppt t a a 

u y ci ^ y ^ l luu 


1~CTPi~CTCTPapp 

c y y c y ci 


pt tcacttt" a 


aaacraaaaat 


tpttaappGG 


2400 


ppi~'t~rrp , pp , aPT 

y v*» l l y L' \-A v^- a. y 


ttpatppapp 

C- C- y Ci v-" Ci. ^ y 


pt c tcrtt era t 


caaaccatt* c 


agtt aggaca 


ggt t egtace 


24 60 


dp 1 3 1~ r , rrr~ , ")~rTrT 
y ^— - ci l uy l» *— y y 


Ci O C^ V-J Ci Ci Vwj V-J 


agcaaccagt 


tetgetgteg 


gttttaacat 


tcagctgaat 


2520 


pattpppata 

y ci l. l. y ci i— ci 


ppaatpttpp 

U» L> Ct C — i. \ — C I — \^ c* 


atetaaagee 


get gtt gect 


t tt t aggt ac 


ggegat tgat 


2580 


ffr 1 PTPrppr e Pi "P 
y uyy y Lua l ct 


p'p'aaPTT"} - "hp 1 ") - 
ci ci ci u y i_ l u l 


ppptptppsp 

y y ^ ■ c > — > c y v^ci y 


ci y l l u cl y u l y 


pppptapppp 

l^ y y y l ci y l» y 


33naaaprftt 

Cl CL L' CL Cl CL U y L I — 


2640 


y y l y uy uay a 


4- r^ p* "t~ ptptpi p*a pt 
lll Ly y auay 


a a P'Pfpfprtppt 
a. o. uy y y uyL«L 


CfC , c^c'\~^PiCc^c , 

y uy u l y ci u y u 


tppatpptpp 

l y y ci l y y l y u 


rr a pa 1 1 1 apt 

y CL U CL L L l ciy L 


2700 


LUcty aaaUaa 


P , P , P , i~PT^^"t _ ^P5 
U U U Ly QQ L.CICI 


p* ptpt a a nraaf" 
y y ci cl ' ' a. ci c 


appattpppt 

Cl L< U Cl L L U L» y L 


tppa PPPPPP 
i — u u cl y y u y uy 


ttattttcrca 

L LCL L LL L.y v Cl 


2760 




r^a 3 p* p* P" p 1 pr pt pt 
LdciLLLLyyy 


tpfp'tprp't^pt 
LyL Ly L> Laa i— 


PTP 1 PTPTa 1~ PTPTTa 
yuyycL l y u y cl 


_ _4_ j_ _ _ _ /-v-/~i-4- 
U U L L U CL ci y y L 


tp'aprtatpaa 

LUCiy LCILUCLCL 


2820 


f-=3 "i - 3 p 1 p 1 p 1 


a nrpi — I - p> ^ prprpr 
ct y y l n — a. y y y 


u. uy LL>a L. LGL- 


crcrrr<~'P\ ("tptpt^i "T 
yyyuciyyycLL 


rrpppapppt't' 

y U V — UCl U U U L L 


pt ppnataaa 

y l y uy ci l cl uu 


2880 


a. a. Lciciuy a l y 


;=};a;=5;=)PrPTaaPa 
Ct CT- o. ci y y ci ci y ci 


aattatttct 

y cl ' — » — cl i— c U- w c 


attappptpp 


1 1 ppt pppaa 


tgtttgctct 


2940 


yy uoyy dda l 


?1 Pi R t" Pf PIP3 a 1~ 3 
ci ci ci L.y y a. cl i— ci 


<s cl y c c y w u 


cnnccicsPiPiFilr 

v-j y a. Cri a. i_ 


atppaatttp 


appppptcat 

cl y y y v — ' v-H c c* i^l c 


3000 


j_ ^ a_ j_ _ pf pr;=s 

l ci l Ly uy y aa 


apf t pt c f crrrpt 
a. k i_ i_ y ^y y < - 1 


t t PriaPPPCfCT 
c c y ci ci y c l y y 


t'Cfataaacaa 

\— C_-i l_- t-A C-J- CJ, 


atcraccrcrtca 

t>L V— t^L y y L— LA. 


at at ggggca 


3060 


^i;p"rp i ;=irTP';=iPT*r~ 
ctct l u a. y u a. y l 


33P , p'PTPTi ^_ ■^~'^*p , 

cl c*- w *w y y l l 


Pt t CfPPCft t PP 
cicyuyy ccyy 


nrra acfRi"R^"^ , 

V-j KJ QL C4. v-J CA V-H O 


^ Ci V-J ^ C *wj 


cttttcrttat 


3120 


l u a. l l LuL»y y 


y ci ci c y U- ci y ^ cl 


c p p t p pt pap 
*— * y y y y — y y 


t paappt pta 

c y cl ci l> y c y c cl 


ppt ptppppt 


tt caepptat 

c c c* cl l> y y y k — 


3180 


err err rta f" pt rr t 
3 □ Ly y l 


3 o q p a ir pprfPj 

CL CL Cl Cl Cl l_ V — . V_J V_J 


Pi t pt rrrt" 1 1 c 

u LU L L L L* 


ppt pppa pap 

y l y y y cl y cl y 


pppppa pppa 

y y y l> cl y y y ci 


tapppapcaa 

c cl y u u cl ccu.w 


3240 


Let l Ly yoy l cl 


y^y l Ly t_ i_ u. y 


a t pa t pa a pp 

cl l y en l y ci ci y y 


aaapptppta 

Cl Cl Cl L» L» L U y L Cl 


pppattaatp 

V— » ^ — y Cl L L CL CL c u 


pt pptppapc 

y L U U L U U (-L VJ c 


3300 


aaap1~ppaaa 

Gl GL Cl \— * I V_4 y a. Gl 


eggctttatt 


caggct ct ac 


ttegctacat 


ttcatcgcca 


aatatcgtgc 


3360 


LaL/L-y yyoy l 


oy y y l lcil l y 


rfHcinc i p)\~c , nc* 

y uy y l- cl l uy l> 


nPiPifricccPiC! 


pppt ppt t pt 


ptt1~aappi"a 

C. C C CL CL w C CL 


3420 


L.ociy LctcL l uy 


■ht"r'r3Crr , ?5n"a1~ 

i_ 1 — v-* cl y ci y cl L- 


aatptpataa 

CL cl Ly Ly Q Laa 


pa ppa a pa pp 

CL Vwj C4. t_-L V_> 


apaptpapta 

C4. CL V^j C- v-j Ci C- Ci 


ataaaaappt 

CL L Cl CL CL CL Cl L» y L 


34 80 


u ci ci l y l. ct cl y y 


Cl^P+"P , PfP'^pTpT 

cl ci cl i — y v — r en y y 


=itara4-3=>p<=i|-4- 
CLCLCL L CLuLQ L L 


ptppttpptP 

U L y u l l y l-> l y 


ppappt a t pp 

y l> cl y y 1 — Cl L U U 


t pa t pt t pat 

l y ci l y l l l ci l 


3540 


prrrr 1 3 ;3 "P pra "h PT 
uuLaa l y ci l y 


^"t"t"crpp^'^'? : ^p , 

y L> l^ y <^ w y y ci 


PT P 1 PT P 1 1 PTa 3fTP 
y u y u l y ci ci y u 


pppaptppec 

yyyciy L.y y 


ttapptpppa 

1 — l ci y y l y u» y ci 


ptppppt aat 

u l \ws y u y l ci ci l 


3600 


4-4~;34~p , p , PTPfP 1 ; = i 
L Let LLLyyLa 


y y y ci cl cl cl cl ^ 


aciy cy LQQL L 


t rr pppt papa 

l y u l» y l y ci cl 


aataatpratpf 

ci ci c cl ci l y cl 1 y 


3 = 334-3 rrt a p 

Cl Cl Cl CL L CL y L Cl U 


3660 


r"-f- a1"ti-aat t 

L. La. L L Laca. L L 


Pr4r4l"PRi~PPTPf 
v^-> cl ci i_ \_<. ci i — y y y 


trrpaaaatpp 
l y y ct ci cl ci Ly l 


U y cl l y y Ly lcl 


aappa t ppt p 

L-j. d cj y ci l y y l U 


ptt t t a teat 

y LLLLCLLUyL 


3720 


rra r , rTP 1 P , *r"P , P"H 
y a. u y UU LLL L 


p*h crtttcrprra 

1 > — l y l u uy uya 


^~rrp\p\crcrcrp\P{p\ 
Lyciciyyycicici 


aaaapapaat 

CLCicLciy cl y ci ci l 


acpt t a ppt a 

LLUU L LCLUy LCL 


1 1 pt t pa t pp 

L LUL l y Cl l y u 


3780 


aacaaataac 


caattgccac 


aggaceggga 


aagtttattc 


tggatgaacg 


ttaaagegat 


3840 


teegtcaatg 


gataaatcaa 


aattgactga 


gaataegcta 


cagctcgcaa 


ttatcagccg 


3900 


cattaaactg 


tactatcgcc 


eggctaaatt 


agegttgeca 


cccgatcagg 


ccgcagaaaa 


3960 


attaagattt 


cgtcgtagcg 


cgaattctet 


gaegctgatt 


aacccgacac 


cctattacct 


4020 


gaeggtaaca 


gagttgaatg 


ccggaacccg 


ggttcttgaa 


aatgcattgg 


tgcctccaat 


4080 
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gggcgaaagc 


acggttaaat 


tgccttctga tgcaggaagc 


aatattactt 


accgaacaat 


4140 


aaatgattat 


ggcgcactta 


cccccaaaat 


gacgggcgta 


atggaataac 


gtcgactcta 


4200 


gaggatcccc 


qqgtaccgag 


ctcgaattca 


ctggccgtcg 


ttttacaacg 


tcgtgactgg 


4260 


gaaaaccctg 


gcgttaccca 


acttaatcgc 


cttgcagcac 


atcccccttt 


cgccagctgg 


4320 


cgtaatagcg 


aagaggcccg 


caccgatcgc ccttcccaac 


agttgcgcag 


cctgaatggc 


4380 


gaatggcgcc 


tgatgcggta 


ttttctcctt 


acgcatctgt 


gcggtatttc 


acaccgcata 


A A a n 
4 4 4 U 


tggtgcactc 


tcagtacaat 


ctgctctgat 


gccgcatagt 


taagccagcc 


ccgacacccg 


4500 


ccaacacccg 


ctgacgcgcc 


ctgacgggct 


tgtctgctcc 


cggcatccgc 


ttacagacaa 


4560 


gctgtgaccg 


tctccgggag 


ctgcatgtgt 


cagaggtttt 


caccgtcatc 


accgaaacgc 


4620 


gcg 












4623 



<210> 162 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<400> 162 

aagatcttaa gctaagcttg aattctctga cgctgattaa cc 42 



<210> 163 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 163 

acgtaaagca tttctagacc gcggatagta atcgtgctat c 41 



<210> 164 

<211> 5681 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pFIMD 



<400> 164 

tcaccgtcat caccgaaacg cgcgagacga aagggcctcg tgatacgcct atttttatag 60 
gttaatgtca tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg 120 
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cgcggaaccc 


ctatttgttt 


atttttctaa 


atacattcaa 


atatgtatcc 


gctcatgaga 


180 


caataaccct 


gataaatgct 


tcaataatat 


tgaaaaagga 

Z3 ZJ ZJ 


agagtatgag 


tattcaacat 


240 


ttccgtgtcg 


cccttattcc 


cttttttgcg 


gcattttgcc 


ttcctgtttt 


tgctcaccca 


300 


gaaaccfctgg 


tgaaagtaaa 


agatgctgaa 


gatcagttgg 


gtgcacgagt 


gggttacatc 


360 


gaactggatc 


tcaacagcgg 


taagatcctt 


gagagttttc 


gccccgaaga 


acgttttcca 


420 


atgatgagca 


cttttaaagt 


tctgctatgt 


ggcqcqqtat 

_J _J ZJ Z? Z3 


tatcccgtat 


tgacgccggg 


480 


caagagcaac 


tcqqtcqccq 


catacactat 


tctcagaatg 


acttggttga 


gtactcacca 


540 


gtcacagaaa 


agcatcttac 


ggatggcatg 


acagtaagag 


aattatgcag 


tgctgccata 


600 


accatgagtg 


ataacactgc 


ggccaactta 


cttctgacaa 


cgatcggagg 


accgaaggag 


660 


ctaaccgctt 


ttttgcacaa 


catgggggat 


catgtaactc 


gccttgatcg 


ttgggaaccg 


720 


gagctgaatg 


aagccatacc 


aaacgacgag 


cgtgacacca 


cgatgcctgt 


agcaatggca 


780 


acaacgttgc 


gcaaactatt 


aactggcgaa 


ctacttactc 


tagcttcccg 


gcaacaatta 


840 


at agactgga 


tcrcracfgccrcra 


taaagttgca 


ggaccacttc 


tgcgctcggc 


ccttccggct 


900 


ggctggttta 


ttgctgataa 


atctggagcc 


qqtqaqcgtg 


ggtctcgcgg 


tatcattgca 


960 


Qcactaaacrc 


cagatggtaa 


gccctcccgt 


atcgtagtta 


tctacacgac 


ggggagtcag 


1020 


gcaactatgg 


atgaacgaaa 


tagacagatc 


gctgagatag 


gtgcctcact 


gattaagcat 


1080 


tggtaactgt 


cagaccaagt 


ttactcatat 


atactttaga 


ttgatttaaa 


acttcatttt 


1140 


taatttaaaa 


crgatctaaqt 


gaagatcctt 


tttgataatc 


tcatgaccaa 


aatcccttaa 


1200 


cgtgagtttt 


cgttccactg 


agcgtcagac 


cccgtagaaa 


agatcaaagg 


atcttcttga 


1260 


gatccttttt 


ttctgcgcgt 


aatctgctgc 


ttgcaaacaa 


aaaaaccacc 


gctaccagcg 


1320 


qtqgtttqtt 

z) zj zj -3 


tgccggatca 


agagctacca 


actctttttc 


cgaaggtaac 


tggcttcagc 


1380 


agagcgcaga 


taccaaatac 


tgtccttcta 


gtgtagccgt 


agttaggcca 


ccacttcaag 


1440 


aactctgtag 


caccgcctac 


atacctcgct 


ctgctaatcc 


tgttaccagt 


ggctgctgcc 


1500 


acrtcrqcgata 


agtcgtgtct 


taccgggttg 


gactcaagac 


gatagttacc 


qgataaqqcq 

_3 —i -J zj zj 


1560 


cagcggtcgg 


gctgaacggg 


ggqttcQtgc 


acacagccca 


gcttggagcg 


aacgacctac 


1620 


accgaactga 


gatacctaca 


gcgtgagcta 


tgagaaagcg 


ccacgcttcc 


cgaagggaga 


1680 


aaggcggaca 


ggtatccggt 


aaqcqqcaqg 


gtcggaacag 


gagagcgcac 


gagggagctt 


1740 


ccagggggaa 


acgcctggta 


tctttatagt 


cct gtcgggt 


ttcgccacct 


ctgacttgag 


1 0 n r\ 
18 U U 


cgtcgatttt 


tgtgatgctc 


gtcagggggg 


cggagcctat 


ggaaaaacgc 


cagcaacgcg 


1860 


gcctttttac 


ggttcctggc 


cttttgctgg 


ccttttgctc 


acatgttctt 


tcctgcgtta 


1920 


tcccctgatt 


ctgtggataa 


ccgtattacc 


gcctttgagt 


gagctgatac 


cgctcgccgc 


1980 


agccgaacga 


ccgagcgcag 


cgagtcagtg 


agcgaggaag 


cggaagagcg 


cccaatacgc 


2040 
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aaaccgcctc 


tccccgcgcg 


ttggccgatt' 


cattaatgea 


gctggcacga 


caggtttccc 


2100 


gactggaaag 


cgggcagtga 


gcgcaacgca 


attaatgtga 


gttagctcac 


tcattaggca 


2160 


ccccaggctt 


tacactttat 


gcttccggct 


cgtatgttgt 


gtggaattgt 


gageggataa 


2220 


caatttcaca 


caggaaacag 


ctatgaccat 


gattacgeca 


agcttgaatt 


ctctgacgct 


2280 


gattaacccg 


acaccctatt 


acctgacggt 


aacagagttg 


aatgccggaa 


cccgggttct 


2340 


tgaaaatgca 


ttggtgcctc 


caatgggcga 


aagcacggtt 


aaattgeett 


ctgatgeagg 


2400 


aagcaatatt 


acttaccgaa 


caataaatga 


ttatggcgca 


cttaccccca 


aaatgacggg 


2460 


cgtaatggaa 


taa ccrcacf qq 


ggaatttttc 


gectgaataa 


aaagaattga 


ctgccqqqqt 


2520 


gattttaagc 


cggaggaata 


atgtcatatc 


tgaatttaag 


actttaccag 


cgaaacacac 


2580 


aatgcttgca 


tattcgtaag 


catcgtttgg 


ctggtttttt 


tgtccgactc 


gttgtcgcct 


2640 


gtgcttttgc 


cgcacaggca 


cctttgtcat 


ctgccgacct 


ctattttaat 


ccgcgctttt 


2700 


tagcggatga 


tccccaggct 


gtggccgatt 


tatcgcgttt 


tgaaaatggg 


caagaattac 


2760 


cgccagggac 


gtatcgcgtc 


gatatctatt 


tgaataatgg 


ttatatggca 


acgcgtgatg 


2820 


tcacatttaa 


tacgggcgac 


agtgaacaag 


ggattgttcc 


ctgcctgaca 


cgcgcgcaac 


2880 


tcgccagtat 


ggggctgaat 


acggcttctg 


tegceggtat 


gaatctgctg 


gcggatgatg 


2940 


cctgtgtgcc 


attaaccaca 


atggtccagg 


acgctactgc 


gcatctggat 


gttggtcagc 


3000 


agcgactgaa 


cctgacgatc 


ectcaggcat 


ttatgagtaa 


tcgccfcgcgt 


ggttatattc 


3060 


ctcctgagtt 


atgggatccc 


ggtattaatg 


ceggattget 


caattataat 


ttcagcggaa 


3120 


atagtgtaca 


gaatcggatt 


gggggtaaca 


gecattatge 


atatttaaac 


ctacagagtg 


3180 


ggttaaatat 


tqgtqcqtgg 

3 -3 z) Z> ZJ Z> 


cgtttacgcg 


acaataccac 


ctggagttat 


aacagtagcg 


3240 


acagat catc 


aggtagcaaa 


aataaatggc 


agcatatcaa 


tacctggctt 


gagegagaca 


3300 


taataccgtt 


acgttcccgg 


ct era cere t era 


gtgatggtta 


tactcagggc 


gatattttcg 


3360 


atggtattaa 


ctttccrcaac 


gcacaattgg 


cctcagatga 


caatatgtta 


cccgatagtc 


3420 


aaagaggatt 


tgccccggtg 


atecaeggta 


ttgctcgtgg 


tactgeacag 


gtcactatta 


3480 


aacaaaatgg 


gtatgacatt 


tataatagta 


cggtgccacc 


ggqqcctttt 


accatcaacg 


3540 


atatctatgc 


cgcaggtaat 


aqtqqtqact 


tgcaggtaac 


gatcaaagag 


getgaeggea 


3600 


gcacgcagat 


ttttaccgta 


ccctattcgt 


cagtcccgct 


tttgcaacgt 


gaagggcata 


3660 


ctcgttattc 


cattacggca 


ggagaatacc 


gtagtggaaa 


tgcgcagcag 


gaaaaaaccc 


O 1 /.\J 


gctttttcca 


gagtacatta 


ctccacggcc 


ttccggctgg 


ctggacaata 


tatggtggaa 


3780 


cgcaactggc 


ggatcgttat 


cgtgctttta 


attteggtat 


egggaaaaac 


atgggggcac 


3840 


tgggcgctct 


gtctgtggat 


atgaegcagg 


ctaattccac 


acttcccgat 


gacagtcagc 


3900 


atgacggaca 


atcggtgcgt 


tttctctata 


acaaatcget 


caatgaatca 


ggcacgaata 


3960 
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La L LLy aLLa 
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pa *t~ a p a a p a "T* 
ya l a l a a l a L 


4020 


apa pt nrf 3 a t 


ycici uy yoLao 


aapahfrraaa 
ddLu. i— Ly ddd 


pa p a ppa ppp 
LaLciyy dLyy 


apf"+~a*h**hpap 
dy l LdL l l d y 


rrftaa ppppa 
y l LddyLLyd 


4080 


r^^ttpapppa** 


ptattapaap 
L L d L. LaL/dau 


p i +"P i n'P i t''iha'ha 

OL-LyOL. La LCL 


apaaappppp 
uLaciaLyLyy 


y d d d L L d L d d 


pt p-q ft ft pt4~ 4- =3 
LLLdLLy LLd 


414 0 


LLLdyLddL L. 


f—i r~t pr rrp pp a pa 

LyyyLyLdLd 


LLaaLaL i_y l 


a*r"4-4-pap4~pp 

dLL Lydy Lyy 


4~r*jpTppaT"paa 
L d y L L d LLdd 


a p *p "H a *r* t* ppp 
dLLLdLLyyy 




Pra a p ft a pt t a 

y a a l y d y Ldd 


t pt t p pr a t rf2 pt 

l y Luy d l y d y 
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gcgaatggcg 


cctgatgcgg 


tattttctcc 


ttacgcatct 


gtgcggtatt 


tcacaccgca 
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tatggtgcac 


tctcagtaca 


atctgctctg 


atgccgcata 


gttaagccag 


ccccgacacc 


5580 


cgccaacacc 


cgctgacgcg 


ccctgacggg 


cttgtctgct 


cccggcatcc 


gcttacagac 


5640 


aagctgtgac 


cgtctccggg 


agctgcatgt 


gtcagaggtt 


t 




5681 



<210> 165 

<2H> 40 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Oligonucleotide 
<400> 165 

aattacgtga gcaagcttat gagaaacaaa cctttttatc 4 0 

<210> 166 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 166 

gactaaggcc tttctagatt attgataaac aaaagtcacg c 41 

<210> 167 

<211> 4637 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pFIMFGH 



<400> 167 
aaagggcctc 


gtgatacgcc 


tatttttata 


ggttaatgtc 


atgataataa tggtttctta 


60 


gacgtcaggt 


ggcacttttc 


ggggaaatgt 


gcgcggaacc 


cctatttgtt tatttttcta 


120 


aatacattca 


aatatgtatc 


cgctcatgag 


acaataaccc 


tgataaatgc ttcaataata 


180 


ttgaaaaagg 


aagagtatga 


gtattcaaca 


tttccgtgtc 


gcccttattc ccttttttgc 


240 


ggcattttgc 


cttcctgttt 


ttgctcaccc 


agaaacgctg 


gtgaaagtaa aagatgctga 


300 


agatcagttg 


ggtgcacgag 


tgggttacat 


cgaactggat 


ctcaacagcg gtaagatcct 


360 


tgagagtttt 


cgccccgaag 


aacgttttcc 


aatgatgagc 


acttttaaag ttctgctatg 


420 


tggcgcggta 


ttatcccgta 


ttgacgccgg 


gcaagagcaa 


ctcggtcgcc gcatacacta 


480 


ttctcagaat 


gacttggttg 


agtactcacc 


agtcacagaa 


aagcatctta cggatggcat 


540 


gacagtaaga 


gaattatgca 


gtgctgccat 


aaccatgagt 


gataacactg cggccaactt 


600 


acttctgaca 


acgatcggag 


gaccgaagga 


gctaaccgct 


tttttgcaca acatggggga 


660 


tcatgtaact 


cgccttgatc 


gttgggaacc 


ggagctgaat 


gaagccatac caaacgacga 


720 


gcgtgacacc 


acgatgcctg 


tagcaatggc 


aacaacgttg 


cgcaaactat taactggcga 


780 


actacttact 


ctagcttccc 


ggcaacaatt 


aatagactgg 


atggaggcgg ataaagttgc 


840 


aggaccactt 


ctgcgctcgg 


cccttccggc 


tggctggttt 


attgctgata aatctggagc 


900 



WO 01/85208 



PCT/IB01/00741 



-79- 



y uy ay l 


y y y LLLLyLu 


pf4-24-pt24-4-pfp< 
y La LLa l LyL 


^ ppa pf - pi- pf pf pt 

ay Lao l y y y y 


p r*a n"a1" crcrt* a 


aarrrlrrprr 


960 


L a L L y Lay L L 


ai4ppi4--a)pt2p i PTai 
a ll l a l a l y a 


P , PTPfprPfa^pf4-p«ai 
l y y y y a y LLa 


pfPT(^aiap4rai4-pf 
y y l» a a l a 1 — y 


cralrraapCTaa 

V— J U- V-J C_i w V^j C-A 


at"apr5parr?4't" 

t-t cj. y a u a y a l 


1020 


p^prp i 'T~pfaipfai4-2, 
L/tj^uy ciy aLa 


yy LyLLLLaL 


4ppr;a4-4-;a.2.pfP';=i 
Ly a l LaayLa 


'} _ 4"pfppt~a52pi4ppf 
l Lyy LaaoLy 


"i~P , aipfa|P<p'a3=ipr 
u. a y a *^a a y 


"t~'T""t _ 3P 1 't _ P , ai4pai 
L L La^LL«aLa 


1080 


La LaL l l Lay 


aLLyaLLLaa 


a5a»p , 4 - 4-p«pi4p4r4p 

aaLLLL-aLLL 


l l a a l l l a a a 


Pi ncspi 1" pi - r an 
y ^ y y 


i"Pr5r9rf3i"PP"r" 
1^ y a a. y a v— \ . k ■ i_ 


1140 


lll uy d Lad L 


L LLa LyaLLa 


2224-p«p<p«4-4-a» 
a a a llll l l a 


2 p< pf 4p pr ai pf 4- 4p 4p 
a l y l y a y l l l 


i~prTi"1~pPr?pi" 


pr p\ rrPPT'h'P'aspfai 
y a y w y i_ w a y a 


1200 


LLLLy l ay aa 


adya L La. a. ay 


p3 "h pi — Hp'4 — f~pf 

yaLLLLLLLy 


2 prai4-p'p'4p4p4p4- 

ayaLLLLLLL 


-j — t — f— p , 4~prp , pTp , pr 

L L LL Lywy uy 


4-aia>4-p«4ppTp , 4-pr 
l a a l l> LyL Ly 


12 60 


!~i *t J~ /T /"-I 2 3 ^ pt 2 

L L L y UaaaUa 


aaaaaaLLaL 


r~* m~i-\- 2 p'P'a pr p 1 
LyL LaLLayL 


pfpf4~ PTPf"t _ 4 - "f _ pf4" 
y y Ly y l l Ly l 


■|~ "h rrppfrrra "hp 

l Ly Lt>y y a l l. 


plPiC^Pl^^cf~P\^ t ^ , 
aayay 0 LaLL 


1320 


aaLLLLLLL L 


pi pi Pf 2 2 Pf Pf 4" 3 3 

L L y a a y y L a a 


p 1 4~ rT pf p< "h 4~ p> a pf 
LLyyLLLLay 


r~> 2 pr 2 pfP'Pfp'ai pf 

Lay ay LyLay 


3i~aip«p»aiajaa4p=j 
a LaLLaaa La 


L Ly LL>v_*LLLL 


1380 


aipr4~pr4-aiprp , p , pr 

cay l y Lay ullj 


4- 3 rr"l — P ai pt pr p* P 1 

Lay l LayyLL 


a l l a l l LLaa 


Pf 2 2 p»4p p«4p pf4p aj 
yaaLLLLy La 


pr p< ai P" p« pf p 1 p 1 1" P 
y l* a 0 l< y u u l a 


nafa pp1~ r*np 
l- a l a v_y l< i_ l< y 0 


14 4 0 


LL LyULaaLL 


L L. y L LaLLaLj 


4r /~f /~f p 1 4~ PTp , i - pfP i 

LyyLLyLLLJL 


c~> 2 pf4p pr pf p 1 pr ai 4p 
Lay LyLjLyaL 


ajnpf4ppipf4-pT"t - p 1 
aay Luy l y lo 


4-4-2 p , p , prpfpf4 _ 4~ 
l LaLLy yy l l 


1500 


PTPf 2 f* 4~ P"« 2 2 PT ai 

yya.OLOcaa.yci 


pi pr 2 4- ai pt4~ 4- 2 P 1 
LydLdy LLaL 


p 1 pf pf ^ 4- 0 2 pfpfp' 
Lyy a l a a y y l 


npp pf p< pfpf 4~ p< pr 
y Lay Lyy l Ly 


Pf PT P 1 +" PT Pi Pi P 1 Pf Pf 

y y l Ly aauyy 


Pfpfprpf4p4pp'pf4~pr 


X J LIU 


pt pi — i v~i «a rr r~< r~* r" 1 
OaL.aL.aLjL/LL- 


-3 rrp'l - 4 _ prpraiprp i 
a y l L L y y a y l 


rra a p , pt3 P 1 p 1 4" ^ 
yaaLyciLL l a 


f~< 2 pprraa p^ 4" pr 

LaLLyaaL l y 


anafsppfap 
aya l a ^ — ^ — l cl l- 


^PTP , pr'l - PT3prp , 'r" 
ay Ly LyayLL 


1620 


aLLjaLJaaaLJL 


rrp P 1 ai Prfp4"4r P" 
y L La L y L L LL 


P'P'Pf^l 3 PfPfPf^ PT 

LLyaayyyay 


222 pr pfp'Pfpra} p 1 
aaayy LyyaL 


a> pr pf 4- ai 4~ p prrrr 

a y y l a l l- L»y y 


taarfrrrrfrarf 

l a a y l> y y l< a y 


1680 


pf pf 4~ pi rr rr 2 2 p"« as 

yy LLyyaaLa 


ptpt 2 t~f 2 f~rfi pf pi a 

yyayayLyLa 


pn~a pt pf pf p> pfp 1 '} - 
LyayyyayLL 


4~ p~> p~< 2 pr pf pf pr pt ai 
LLLayyyyya 


2 ai n ptp' P 1 "H pr pri - 
aaLLjLLLyy L 


aLL» l l l a l a y 


17 40 


lll l y LLyyy 


L L LLyLLaLL 


4- p< 4~ pr ;a p , 4 _ 4~prai 
ll l y a l l l y a 


PTP<pf4 - p , pfai4-4-4- 
y Ly LLyaLLL 


i~1 - pf1 - n'^"hpp1~ 

L l y l y a l y i_ 


p 1 pr 4~ p 1 ai prpf pt pf pt 
Ly LLayyyyy 


1800 


r~> r~r /~t 2 /-r /~> /~> 4~~ 2 

yLyyayLL La 


4- pr pr 2 3 2 2 2 i^~»/T 

LL| y aaaaaLy 


LLayLaaLy L 


prprp^p^4-4-4-4-4- 2 
yyLLL L L LLa 


fTrn f" 4p p^ pi 4- prpf 
Lyy l lll Lyy 


r -t^4-4-4-4-pf»-t4-pT 

L< L- L LLLyLLy 


18 60 


ptr pi pi 4— 4— 4— 4— /~T /— i 4— 

yLLL L L LyLL 


papai"n'4"4"p4" 
La. La Ly L LLL 


4~4 _ p , p , 4 - pfp i pf4~4~ 

LLLLLyLy LL 


at'pppp'hpa't" 

a LLLLLLyaL 


"t~p , +"pr _ t~pfpc^'h"3 
LULy LyyaLa 


a5p , p , Pff"a)4~1-aap' 
aLLy La l LaL 


1920 


p , pfp , p*4p*l~"i~pfaiPT 
LyLL L L l y a y 


4~" pr ai prp , 4 - pt ai 4p 2 
LyayLLyaua 


p , p i ptp , +" p'pfp'p'pr 
L l y L L Ly L l y 


pi 2 pfp'p'pfaia) p"Pf 
\ — ay L-Ly aaoy 


a> p i p , pr^ PTP'PfP'Pi 
a 1 — ■ uy uy <^ y v^-a. 


firnanlrRrfi" 

l< y ayj i— ua.y i_ 


1980 


praipfp'PraipfPfaiai 
yay Ly ayyaa 


PTP 1 Pf Pfaj a prai pr p> 

yLyyaayayL 


PfP , P , P , 3a)4pa5p'pr 
y l l l a a LaLy 


L<aaav_royL/L« l 


rtTrrrarar' 
w u ^ y y 


crtt crcrppaa t" 

y ULyyuuyaL 


2040 


4- < ra 4- 4- a "h rrr 1 
LLa l L a a lljl 


2 pr r~> 4~ pf pf P" ai p , pf 
a y l Lyy LaLy 


2 pp pr pf 4~ 4~ 4~ p" p> 
aLayy l l lll 


p< pf 2 p>-hpfpraia5ai 
l y a l l y y a a a 


rrrnnrfpri pf 4~ pr 

y l> v-j y y \s a y l y 


a y L^y v_>a. a Ly l 


2100 


aa L Lad. L y L y 


2 pt4 — \~ a pp4- na 
dyLLayCLLa 


p , 4 - r^a54p4-2prpfp" 
LLLaLLayyL 


2 p , p | p<p , aipfpfp"4 - 
aLLLLayyL l 


4-4-2pi2p<4-4p4-2 
L LaLaLL L La 


i^PfP^I" f~ P^PTTPTP' 
LyL L LLLyyL 


91 fin 

J_ U L 1 


4- f~> pr"f" sfrfftrr 
L L y La lu l Ly 


■fp pt4~ pr pr 3 2 4~ 4~ Pf 
Ly l y y a a l Ly 


4p pf ai pr p« pf pf a> 4- aa 
l y a y Lyy a La 


aiP i aiai4p4--{-p»ajp' 
a l a a l l LLaL 


PiCPinCTPiPiPiHPi 
a ua y y a a a 1 — ' a 


p p t~ a t" era p p 3 


2220 


Ly a l l a l y l l 


2 21 rr p< 4 — (- a "h pt ai 
aay LL LaLya 


pfa>aiaip , aia)aiP i P 1 
yaaaLaaaLL 


J_ -4-4- 
LLLLLaLLLL 


r~ , 4-pf4-pfptpfp'4-4- 

L Ly LyLyLLL 


1~'ri""t"pi~ppp1~ 

LLLLy Lyy l l 


22 8 0 


/-f P" Pf Pr 4~~ Pf 2 pr 4~~ 

gyLyy l y ay l 


/—1 23 /—1 f~c /~t 4— 4— 4— rvrr 

LaLycLL Lyg 


p^ 4~ /TP prpr 2 4~ 2 pr 

LLyLyyaLay 


pi2p , pfai4~4~aip , 4 _ 
LaLLja L LaL L 


21 4~ pip i prp^pfpfp , 4 _ 

a LLLLjLyyLL 


2 4- pt4~ p 1 ai pr pr pr ai 
a l y LLayyya 




4- 2 ^ P 1 pf pf P 1 4~ Pf 4~ 

LdaLyyLLy l 


2 pr 4- ry 4- pfprp'p'pr 

a y l y LyyLLy 


pi4^ pfa^a^4 - p , alalp , 
l l y a a l l a a l 


l a a L L L l a L L 


pp}~"f - pfai'r~p , "hpra) 
y l Ly a ll Ly a 


f~ncTPP\P\Psr^rsc* 

Lyy aaa.cj.uy u 


2 4 00 


gyLyaayLaa 


4~ 4~ 4— 2 2 /~« 2 2 p< 2 

L L LaaLaaLa 


4- 4~ pf pf p 1 pf p* pt ai p 1 
l Lyyoy LydL 


pf 2 pt4-pip^4~pf4~4~ 
yaL lll Ly ll 


Pf4-4pp i p«a54-4-4ppi 
y LLLLaLLLL 


yLaLLLLyLL 


9 4 60 


/-«-4— p« as prr<-)- pt 4- 
QLLaLLL Ly L 


yyLaaLgccy 


4 - 4 - 4- pt 4~ ppppf" 
LLLLLyCLy L 


2 2 2 prpf 4~ 4~ pr pr pf 
aaayy L L y y y 


H — 1 — I— 2 p< 4~ Pf Pf P" Pf 

l l LaL Ly y Ly 


^^CfCPinPi^PirT 
L L y l a y a Lay 


9^90 
z, w 


ccacaatgcc 


aacctgettg 
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aaaatcaaat 


accccttaat 
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ccgcgctttc 


2640 


gtggacgacc 


ctgacgccgg 


gtaaaccaaa 


tacgetgaat 


ttttacgccc 


ggctaatggc 


2700 


gacacaggtg 
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2760 


tcagtaactg 


gagatgetea 
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caaacgtggg 


tatgtattgg 


eggcaatatt 
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C4. L_ ^— ' 


a t c a c crcT t era 


a ppTprf- aacrPT'h 

n l- y y i_ f_j. t— l y y l 


2880 


L^y LOy 


p n pf Pf "h p\ can 
l l y l y l a l y y 


L L LLLaLLaL 


p^alrfrrarfr 

l< a a l y w w a < y 


rri~i-n'at"pt"r , rf 

y l Ly a 1— • v_» i_ ^ y 


yLyaLLL l La 


2940 


^ — v_ Luay L 


LL.Ld.Ly LL Ly 


LLyyyyLyyL 


a+"P , PfPTP i r , 1 - PTPT 
a LLyyLLLyy 


p'a'TPTaf'PT'h'hpr 
La l y a l y l Ly 


P i PTP , "h"t~PTapff~f" 
LyLLLyay ll 


3000 


rrapfaaffpf 


LLy y l y y y a a. 


ppf p pa pppf 
Ly LLyayyy l 


r^ar , i~PTP , p , PiPTP 1 
l a l Ly li — a y l 


■f 4- /-< a PTP 1 PTPTPTPf 

l LLayLyyyy 


parrpprrs parr 
Lay l l y a l a y 


3060 


L-a^^yy a La L 


L CL LCtCLCLClClLL- 


PI prprpfPTa C C'T^C 
cty yyyaLLy l 


pipaaaapafp 
y l a a a a l a ll 


p> a pr -f- 1~ a pf a pTp- 1 
La y i— 1 a y ay l 


fa pa rfpra f pa 
LuLayya Lya 


3120 


n"i" nrf p p 
Lay Ly L|Oaa(j 


a L a L L y CL CL La 


pfppp>ppaa (— • 

yyy 


LaaaaLay l l 


pa pt pt f" pr pr a t" rr 
uay y Ly y a. Ly 


a L LLL LLaLQ 


318 0 


a f nnrfprrpa p 


ffppppffap 

L LLLLy L LaL 


P? PPf C Pi CfP pp 
ay y li — a y a y l 


Pi t* t" pya parrfa 
a l LyaLay La 


Pi Pi t" PTpf p PTpra pr 

a a Ly yuy u a cj 


PP^pf PPfTprf 
l- cl LLuy y y 


324 0 


aauua L Loay 


PPapfpaffa 

y l a y Ly a l La 


ppafpappfa 
y L a LLaLLLa 


f a ppfa pa hp 
l a l l LaLay l 


trfaappp pra a 
LyaaLLLyaa 


pfaPTaf"pra'r~f"PT 
yayaLyaL Ly 


3300 


faafpaaapp 


.1_.1_ .1_j_ 
ay l l a l l a l l 


pfpfffppfp 
L l y L L LyLLy 


+■ a P t "rpTP , T"pfa'H 
l a l l y l l y a l 


PfPTPTP'f'PTPT'r- P^PT 

yyy° L yy L ^y 


prf- a a a f pppf 

y L CL CX CL L y L L L 


3360 


pr /-y 4— /—i -3 4— 4— /-t /-<r <— i 

yy LCaLT-cyc 


L CLJ LdddaLL 


pp p -_ -_ f pprf _a 
yLLaaLyy La 


L L y L La LLLL 


4- —1 4- 4— j— f rrpprrf 

La l Lyy Lyy l 


pTpfpa pfppfppa 
y y Lay Ly LLa 


O *± Z, L' 


a 1" rrf f f a f pf 
a. Ly L L L Cl Ly L 


a a a y—> /—1 H — t— pp p 
CL Cl CL L» L. l Ly Ly 


p p p pf ppf pa 
LLLy l l y l y a 


a "}~pf"hpTPTPTPTP i a 
a l y l y y y y l a 


aaappfppf p 
aaaLLLyy ll 


prf" nrraf pf f f 
y LyyaLLLLL 


J ^ 0 u 


oy auy Uaact L 


/—i 4- 4- 4- 4— prp % f~ 1 a f 
LLLLiyLLaL 


ciaLya l La ll 


ppfpa aa npaf 

LyyaaaLLa l 


l a l a y a l La l 


prfpapapfpp 
y LLaLaL Ly l 


^ S4 0 


3 3prran , rrp'hp 
ciauy ay y l. l 0 


yyLL La l y y l 


PTPTP 1 PT+" PT"! - fat" 

y y ^y L y t - Lci i - 


LLaaLLLLLL 


p 1 pfpTPTa p> p^ pr "h a 
l y y y a l l y La 


aaafafaprfp 
aaa La Lay Ly 


3600 


pp a pf a ppf a 
y uciy Lay l» l a 


f ppafffppf 

L LLCl L L LLLL 


pir , p i a p 1 1" 1 a pt p" pt 
aLLaLLay l y 


aaaprfpprrprr 

a a a \_- y v — v_»y y 


p pf-r +■ rrf* t" f* a t~ 

L-y (_L-V_J L-CL CL l_ 


3r)f f prrarraa 

CL CL l_ 1 — L< y CL ^ CL CL 


3660 


L-y y a uaay 


pf /-rrrr~ i ( , ~'PTrT't~ P 
y Lyy LLyy Ly 


pTP , prp , T~T"+"a'hi~ 
y l y l l l l a l l 


4- rra en cc^ ctf~ 
LyaLyLLLy l 


pfa PTPa PTT - PT P 1 PT 

y a y l a y l_ y l y 


PTPTPrrnrrrprfprr 
y y Ly y y y uyy 


3720 


r*TTPt~h1~PiPtP)(Tr~' 

oy a l Laaay 


\~ n cx )r p f f a 
Lyy l LLa l La 


f rTfTTtY Cft~* 
CL L LyLLy l y L 


...... 

LLaLLLLyLy 


a pa pra p pa a p 
a La y a l> l» a a l 


aapf/^fp)apa 
a cl i_ cl i_ cl cl l- a 


3780 


pppafpafff 
y^y a LyaL l l 


ppp pf f f pf P 

LLay LLLy Ly 


fprf a a i" a f "h t" 
LyyaaLaL l l 


ptcrrrT'PiptpiPi 
a l y LLaa Laa 


■f rra +" ptt - prprf" pt 
Lya Ly Lyy Ly 


pfptppfapfp 
y LyLLLaLLy 


384 0 


yuyy l Ly L-y a 


fpfffpfppf 
Ly LLLLLyLL 


ppf pa f pt"T P 1 a 
Ly l y a l y LLa 


pprr+"hap'l"pf" 

L L y L L a L L L L 


rrpppfpfa ptap 
y l l y y a l l a l 


p p f pprf f p a rr 
ll Lyy l LLay 


_) — » \j \j 


+"pppa a f f p p 

l_y LLaCt L L L L 


"hpf'happrf'h'l" 

L L L LaLLy L L 


4--a •f~+-pT"hPTP , PTa 
l a l Ly l y l y a 


aaaappaaaa 
a a a y l l a a a a 


P 1 P* 1 f" PTPTPTPTT" a f* 

ll Ly y yy l a l 


fappfpfpppr 

LaLL LL LLLy 


39 60 


pr p> a <— i -a ra r~> r~* n (~* 

y LauaauLy l 


ayaLyLyyyL 


aa P i "t - P" , PTa"T"t~'J - 
adLLLyaLLL 


"hp , ap , » _i aa"t - ap i 
LLaLLaa l a l 


PPPpf <— ^ pr 4- 4- 4- 

l y l y l l y l ll 


f p«a p- p f p p> a p 
L l a L L Ly l a L 


4 n 9 n 


a nTTrr/TT+" turret 
ciy y y Ly LLyy 


pn^a rfl — hp 
Ly LaLay l Ly 


a /~ < rrp 1 ptp^ a a p 1 pf 
QLyLyLaaLy 


prfa p 1 pt a "h +" a I - 
y LciLya l lcll 


fppapfppaaf 
LLLay Ly aa L 


aa pa pppfaf 
a a l a l y y l a l 


4 08 0 


Ly l Lay y ay l 


a pf a rrrrrspt p f 
ay LayyyaLL 


"T P i PTPTP^PTPft~PT3 

L^yy Ly y L.y a 


pTT _ P' , 1 - pfpfpra"i-'r 
y LLLyyyaLL 


aaLyyLaaa l 


f af p/paprrf a 

La c y v — ^ cl v_- y c cl 


414 0 




ppf pa p"h ppa 
yy Ly aL Ly La 


PTPTPraa+"PTT"PfP^ 
y y y a a l y l y l 


aafpaaffaf 
aa l l y a l La l 


•f- ptptp'pt'T - pra p t i~ 
l y y l y l y a l l 


f f f af ftair 

LLLy LLLaLL 


4 200 


aafaafpfap 

Ct Cl ' — CL CL 1 L» 1 CL y 


ayya LLLLLy 


prpf +- 0 pprra esc* 
yy LaLLyayL 


"r"prfaa1"r"pap 
i_. l y a a i_ 1 — a l 


T"ptptp , p , pt1~ pprf" 
i_. y y l Ly LLry l 


f f t" r paapnt" 

' C C CL V-» CL Cl \_» y c 


4260 


p pf pa r~» f~ rTfTfT 
L y Ly ao Ly yy 


a a a a pppf* pp 

a a a a l l l Lyy 


r— irr'f _ 'f _ ap >, p , 'p , aa 
Ly l l a l l l a a 


pf f a a f~P >, PTP >, P* 

l l Laa L Ly L L 


4- 4- py pf — 1 prpt 13 p< 3 

l Ly Lay LaLa 


LLLLLL C L LL 


4320 


/-r^o^ py p t~ pf pf p 
ULLCiyLL.yyL 


pf a a+-a rrrTra 
y Laa Lay Lyct 


ayayyLLLyL 


a l l y a li — y l l 


pi4-4-pppra3p=s 
L L LLLLaaLa 


pf fpTpcrcapfp 

y L c y L-y cay l- 


4380 


ctgaatggcg 


aatggcgcct 


gatgeggtat 


tttctcctta 


cgcatctgtg 


eggtatttea 


4440 


caccgcatat 


ggtgcactct 


cagtacaatc 


tgctctgatg 


ccgcatagtt 


aagccagccc 


4500 


cgacacccgc 


caacacccgc 


tgacgcgccc 


tgaegggett 


gtctgctccc 


ggcatccgct 


4560 


tacagacaag 


ctgtgaccgt 


ctccgggagc 


tgcatgtgtc 


agaggttttc 


accgtcatca 


4620 


ccgaaacgcg 


cgagacg 










4637 
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<210> 168 

<211> 9299 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pFIMAICDFGH 



<4UU> loo 

cgagacgaaa 


gggcctcgtg 


atacgcctat 


ttttataggt 


taatgtcatg 


ataataatgg 


60 


tttcttagac 


gtcaggtggc 


acttttcggg 


gaaatgtgcg 


cggaacccct 


atttgtttat 


120 


ttttctaaat 


acattcaaat 


atgtatccgc 


tcatgagaca 


ataaccctga 


taaatgcttc 


180 


aataatattg 


aaaaaggaag 


agtatgagta 


ttcaacattt 


ccgtgtcgcc 


cttattccct 


240 


tttttgcggc 


attttgcctt 


cctgtttttg 


ctcacccaga 


aacgctggtg 


aaagtaaaag 


300 


atgctgaaga 


tcagttgggt 


gcacgagtgg 


gttacatcga 


actggatctc 


aacagcggta 


360 


agatccttga 


gagttttcgc 


cccgaagaac 


gttttccaat 


gatgagcact 


tttaaagttc 


420 


tgctatgtgg 


cgcggtatta 


tcccgtattg 


acgccgggca 


agagcaactc 


ggtcgccgca 


480 


tacactattc 


tcagaatgac 


ttggttgagt 


actcaccagt 


cacagaaaag 


catcttacgg 


540 


atggcatgac 


agtaagagaa 


ttatgcagtg 


ctgccataac 


catgagtgat 


aacactgcgg 


600 


ccaacttact 


tctgacaacg 


atcggaggac 


cgaaggagct 


aaccgctttt 


ttgcacaaca 


660 


tgggggatca 


tgtaactcgc 


cttgatcgtt 


gggaaccgga 


gctgaatgaa 


gccataccaa 


720 


acgacgagcg 


tgacaccacg 


atgcctgtag 


caatggcaac 


aacgttgcgc 


aaactattaa 


780 


ctggcgaact 


acttactcta 


gcttcccggc 


aacaattaat 


agactggatg 


gaggcggata 


840 


aagttgcagg 


accacttctg 


cgctcggccc 


ttccggctgg 


ctggtttatt 


gctgataaat 


900 


ctggagccgg 


tgagcgtggg 


tctcgcggta 


tcattgcagc 


actggggcca 


gatggtaagc 


960 


cctcccgtat 


cgtagttatc 


tacacgacgg 


ggagtcaggc 


aactatggat 


gaacgaaata 


1020 


gacagatcgc 


tgagataggt 


gcctcactga 


ttaagcattg 


gtaactgtca 


gaccaagttt 


1080 


actcatatat 


actttagatt 


gatttaaaac 


ttcattttta 


atttaaaagg 


atctaggtga 


1140 


agatcctttt 


tgataatctc 


atgaccaaaa 


tcccttaacg 


tgagttttcg 


ttccactgag 


1200 


cgtcagaccc 


cgtagaaaag 


atcaaaggat 


cttcttgaga 


tccttttttt 


ctgcgcgtaa 


1260 


tctgctgctt 


gcaaacaaaa 


aaaccaccgc 


taccagcggt 


ggtttgtttg 


ccggatcaag 


1320 


agctaccaac 


tctttttccg 


aaggtaactg 


gcttcagcag 


agcgcagata 


ccaaatactg 


1380 


tccttctagt 


gtagccgtag 


ttaggccacc 


acttcaagaa 


ctctgtagca 


ccgcctacat 


1440 


acctcgctct 


gctaatcctg 


ttaccagtgg 


ctgctgccag 


tggcgataag 


tcgtgtctta 


1500 
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ccgggttgga 


ctcaagacga 


tagttaccgg 


-82- 
ataaggcgca 


gcggtcgggc 


tgaacggggg 
— * ~> p p p p 


1560 


gttcgtgcac 


acagcccagc 


ttggagcgaa 


cgacctacac 


cgaactgaga 


tacctacagc 


1620 


gtgagctatg 


agaaagcgcc 


acgcttcccg 


aagggagaaa 


ggcggacagg 


tatccggtaa 


1680 


gcggcagggt 

— ' —1 ~) P P P 


cggaacagga 


gagcgcacga 


gggagcttcc 


agggggaaac 


gcctggtatc 


1740 


tttatagtcc 


tgtcgggttt 


cgccacctct 


gacttgagcg 


tcgatttttg 


tgatgctcgt 


1800 


cagaQQQqccf 

PZJPPPP Z) 


gagcctatgg 

ZJ z) P P 


aaaaacgcca 


gcaacgcggc 


ctttttacgg 


ttcctggcct 


1860 


tttgctggcc 


ttttgctcac 


atgttctttc 


ctgcgttatc 


ccctgattct 


gtggataacc 


1920 


gtattaccgc 


ctttgagtga 


gctgataccg 


ctcgccgcag 


ccgaacgacc 


gagcgcagcg 


1980 


acrt caataacr 


cgaqgaaqcq 


aaaqaqccfcc 


caatacgcaa 


accgcctctc 


cccgcgcgtt 


2040 


ggccgattca 


ttaatgcagc 


tggcacga.ca 


ggtttcccga 


ctggaaagcg 


ggcagtgagc 


2100 


gcaacgcaat 


taatgtgagt 


tagctcactc 


attaggcacc 


ccaggcttta 


cactttatgc 


2160 


ttccggctcg 


tatgttgtgt 


ggaattgtga 


gcggataaca 


atttcacaca 


ggaaacagct 


2220 


atgaccatga 


ttacgccaag 


cttataatag 


aaatagtttt 


ttgaaaggaa 


agcagcatga 


2280 


aaattaaaac 


tctggcaatc 


gttgttctgt 


cggctctgtc 


cctcagttct 


acagcggctc 


2340 


tggccgctgc 


cacgacggtt 


aatgqtqqqa 

P P J 3 3 


ccgttcactt 


taaaggggaa 


gttgttaacg 


2400 


ccgcttgcgc 


agttgatgca 


ggctctgttg 


atcaaaccgt 


tcagttagga 


caggttcgta 


2460 


ccgcatcgct 

ZJ 


ggcacaggaa 

P ZJ J P 


ggagcaacca 


gttctgctgt 


cggttttaac 


attcagctga 


2520 


atgattgcga 


taccaatgtt 


gcatctaaag 


ccgctgttgc 


ctttttaggt 


acggcgattg 


2580 


atgcgggtca 


taccaacgtt 


ctggctctgc 


agagttcagc 


tgcgggtagc 


gcaacaaacg 


2640 


ttggtgtgca 


gatcctggac 


aqaacqqqtq 

Z) J J J ZJ 


ctgcgctgac 


gctggatggt 


gcgacattta 


2700 


gttcagaaac 


aaccctgaat 


aacggaacca 


ataccattcc 


gttccaggcg 


cgttattttg 


2760 


caaccggggc 


cgcaaccccg 


ggtgctgcta 


atgcggatgc 


gaccttcaag 


gttcagtatc 


2820 


aataacctac 


ccaqgttcag 


ggacgtcatt 


acgggcaggg 
p p p p p p 


atgcccaccc 


ttgtgcgata 


2880 


aaaataacga 


tgaaaaggaa 


gagattattt 


ctattagcgt 


cgttgctgcc 


aatgtttgct 


2940 


ctggccggaa 


ataaatggaa 


taccacgttg 


cccggcggaa 


atatgcaatt 


tcagggcgtc 


3000 


attattgcgg 


aaacttgccg 


gattgaagcc 


ggtgataaac 


aaatgacggt 


caatatgggg 


3060 


caaatcagca 


gtaaccggtt 


tcatgcggtt 


ggggaagata 


qcqcaccqqt 

^3 :3 p p 


gccttttgtt 


3120 


attcatttac 


qqqaatqtaq 

ZJ P P P P 


caccfqtaata 


agtgaacgtg 


tagqtqtggc 

ZJ P P ZJ P 


gtttcacggt 


3180 


gtcgcggatg 


gtaaaaatcc 


ggatgtgctt 


tccgtgggag 


aggggccagg 


gat agccacc 


Jz 4 U 


aatattggcg 


tagcgttgtt 


tgatgatgaa 


ggaaacctcg 


taccgattaa 


tcgtcctcca 


3300 


gcaaactgga 


aacggcttta 


ttcaggctct 


acttcgctac 


atttcatcgc 


caaatatcgt 


3360 


gctaccgggc 


gtcgggttac 


tggcggcatc 


gccaatgccc 


aggcctggtt 


ctctttaacc 


3420 
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tstparrtaat 
i— gl i_.t_.ciy lclq. l 


trrttP^aPaa 
L y L Loa.yv_.ciy 


ataatataat 
a L. a a l y l y a l 


aa papfpraapa 
a a v_. a y y cj. cj. \_. a. 


craacacrt era a 


taataaaaap 

^— Ci. C Ci Ci Ci Ci Ci Vw> 


3480 


rrtpaatpTtaa 
y l octa i— y Laa 


y y ct cl ci Loyoci 


y y a a a l a a o a 


1 1 pt apt tap 

<— i_ v_- i_ y ^ i_ i_ y \_< 


t aa caacrt at 


cctaatattc 

^— * C- Ci C- y C- V— ■ c^ 


3540 


3 t rrpr p a a 4- p* -a 
cl Lyy _aa Lya 


trrrrttrrpprfrr 
l y y l l y o o y y 


P C H C Cl cY Cl P P 
a*_*yoyoLyaa 


n'prTrTrfacf't'crn' 


ccttaaatac 


aact papata 

C4. C V- v-i C-/ Ci C- Ci- 


3600 


^ttt^tppprrr 
a. l i__.ciL.ooyy 


p a rrrrprraaaa 
octyyyoctctcLci 


PCPPCTPCJCPP 
a \_> a a y a y a a. 


p1"i - n'r , ncrL.cra. 

L- 1— "^-^ ^ w y 


caaat aat ga 


taaaaataat 

C- C*L Ci Ci CA C- C-*. C 


3660 


ClV_.V_.LCt L L Ldct 


f 4r pa a f pa i" rr 

L LOCtCtLOCtLy 


rrrrt rrrr a aaat" 
y y luu a a a a i- 


V-J V^J C-A ^ ^ y 


taaaggatgg 


t cgt t t tat c 


3720 


rrt pra rr f" 1 (~* "H p 
y uy auy ul ll 


pi~pi"rft~i"1"pr' 

OLOLy LLLyo 


CfP,]~CXPtPCfCfCfPi 
y a i_ y a a uyy cj. 


a. a. a. a. <— i y ^ 


at acctt a eg 


tattcttgat 


3780 


y O a a Oa a a LCI 


a.pp , aattrrr*r' 

CL O O CL Cl L LyOO 


a *_• a y y a v_» v_. *_j y 


aaaacrtt tat 


t ct ggat gaa 


cgttaaagcg 


3840 


a4 - -f-pppT-ppaa 

Ct L LULy L Oct CL 


tpfPT^taaia^pi 

LyyciLcictctLO 


aaaa l l. y a o l 


rracraataccrp 

v_j d y C i _. < — i ^ 


tacagctcgc 


aat tat cage 


3900 


< — i /-<r /— i ai 4- 4- n a a P 

oy oa L. LdaaL 


4- pr 4- a p4- a 4~ P Pr 
L y LCLOLCtLOy 


PPPPTPTptaaa 
oooyyo l a a a 


L Layoy l Lyo 


a \_- v_. v-*y a ' — v_- a 


y y y c^. \_j cj. cj. 


3960 


ia ra -n 4- 4- 73 ia /-y 11 4— 
ClclclLLclciyclL 


4-4- p pt 4- p p 4~ 3 pt 

l coy Log Lciy 


ppfppra a1"Hpt" 
oy oy a a l lol. 


pi 4- pr a prfptrra 
o Lyaoyo Lya 


ttaappipprap 
l Laao*_>oyao 


-3ppp4-a-f-4-i3p 
a v_- >_ v_- i_ a i_ i— . a v_> 


4020 


p 4~ it m /^i rv/~x 4- ra -a 

LLyacygLaa 


pa rra rff - 4/ rfa a 

o a y ct y l l y ct ct 


4- prp pprpf a a p p 
Lyooyyaaoo 


carreer t P 1 1 1 rr 
oyyy l l-ol Ly 


aaaa4-pTpa4-4- 
a a a a i_yoa l 


aatapptppa 

y y 1 — y v_- \_. c > — > cj. 


4080 


ct Lyyyoyctact 


rfpa pi prpr 4- 4- a a 

yoctoyy l l ct ct 


a t"hrrppl"t"p1" 

dLLyOOLLOL 


rr a +" rrp r rrriR 3 
y a i_ y v_. ay y a. a 


rrpaatattap 


ttarraaaca 

C, C- Ci V_>- Ci Ci C*' CA 


4140 


-a 4- a n a |- rf a 1" "h 
ClLCLClClL.yciLL 


-a 4- pf prp«pf p> i3 p 4p 

ctLyyoyoctoL 


"papppppaaa 
Laoooooaaa 


^trr a prfffrfpp 
a Lyaoyyyoy 


i_- o_ Ci k_ y v-j ci. c_i c^ ci 


av^yv_-ayyyyy 


4200 


ia a 4- ■)- 4- 4- 4- prfp 
CiciLLLLLOyO 


O L y ctct Lctctctct 


apraa+"'TrTap"} _ 
a y a a l i_y a o l 


rrrTTirTPJCft Cf3 
y ^ y y y y *- y a 


ttttaaacca 

C C C- t_i Ci V-j v__* v_-* c^ 


9^aggaataat 


4260 


y LLa La L O L y 


aatttaip/~r-ap 

CtCt L L L Ct Cl y CL O 


1 1 1 = pp a cfCCI 
i i < — a o a y v_, 


Ci Ci C*. Vwl. L_J_ t-i 


t get tgcata 


t t cgt aagca 


4320 


4— ppr'Htto'prpt 

l o y LLLyyoL. 


rrrrt 1 1 1 1 1 1 a 

yy LLLLLL Ly 


f~ pppra p "T p rr t 
Looyao Loy l 


t rrt rcrrpt" crt 


Cfcttttcfcca 

^ Vw-» C- C- C v_> 


cacaggcacc 


4380 


(_ L L y L OCl L O L 


npp pt a npi-pt" 
yooyctoo lo l 


atttt^r^tPP 
aLi-Li_acii_v_'V_. 


rrnrrpttttta 


geggatgate 


cccaggctgt 


4440 


y y o oy a l l l ct 


4- r~ , pfpipr4-4-4p4-pf 

Loyoy LLLLy 


aaa=i4-pfprpfpa 
a a a a Lyyyoa 


ay aa l i_ai_«oy 


rracracfs pat 

v_- a. y y y a. v_- v_j i_ 


a t pap at caa 

CJ. c- c_- w c^ c^ C C-* v^J CJ. 


4500 


LctLOLctLLLy 


a ataa4-/-Y/-r4-4- 
ctct l ci ci l y y l l 


p^^^Pli^cTc^c^ , p\PlC , 
cl La Lyyoaao 


pr p n't pt 3 1 rrt P 

y \_- y i_ya(_y «_ \_> 


apatttaata 

Ci Ci C C C Ci Ci C- C-i 


egggegacag 


4560 


4-rv_3 a pa a rfrrrf 

uyaaociayyy 


a^ptpl^tpppt 
Ct L Ly LLOOOL 


npftrra p a p pr 
yoo Lyaoaoy 


prrrrrP 3 3 rt" p 

uu lu v_a a v_» i_ 


nrra at = t aa 

V_j Cv Ci Ci C* ^ ^ 


ggct gaatac 


4620 


."r /~f pi 4- 4— p 4- pr 4- pi 

ycjciL_Ly ll 


pr p p pr pr 4- a 4" pr a 

yooyy l ct l y ct 


atptp/ptrrrrp 
aLOLyoLyyo 


rrpf^trr^trrpp 
yya Lya Lyoo 


tat at appR t 

c y i_ y y v_> v_- cj. _ . 


taarraraai" 

C Ci Ci C> V_v Ci C> Ci Ci C- 


4 680 


r~r f~r 4- p p a pf pra p 

y y Looctyycto 


pr pi 4p i3 pi 4- r^PPP 

yoLctoLyoyo 


atptp;pTat-pr4- 
aLOLyyaLy l 


trrrrtr'^rrp^rr 
Ly y Loayoay 


paaptaaapc 

y Ci V^- C Ci Ci C-r v ^ 


t gacgat ccc 


4740 


4— y — i i ptpy pi -a 4— 4~ 4- 

LodyyoctL l l 


a "1" rr a n"h a a 1" p 
ci Lycty Lad. LL 


PTPPrpPPPT't"" pf PT 

y oy oy oy Ly y 


L L a L a L LOOL 


nptaaattat 

> — ' i_ y a y i — i_ cj. i_ 


aacratpppaa 

Cj ^ Ci- C- C^ C^ ' CH Cj 


4800 


4- o 4- 4" a a 4" pr p » — ■ 
tat LaatyLL 


pr pra "H "H pr p ~H pi is 

yyctL LyoLoct 


p f 4* a f a a "h f"l" 
aLLa.LaaL.LL. 


papTpprrraaa4- 
o a y y y a a a l. 


a at at p cp a p 

a y \— y ^ — cj. v_* cj. v-^ t- *. 


at eggattgg 


48 60 


s~r r-t /~r 4— a a p a t~r r~* 

go y Lctctoctyo 


pa4 — 4--a : 4—/-rpiia4- 
La LLa Ly La L 


a 4 — H'Haaapp'i - 
a L L LaaaLL L 


apapTapri-rrrTrT 
aoayay l y y y 


tt"aaa1*ai"ta 

l Ci CJ. CJ- I — CJ. 1 — C V^j 


atacataaca 
y '-y^y yy^y 


4920 


4-4-4/- a rrvprra p 

T_LLcioyoycio 


aai^appappt" 
a ct L Ct O Oct O O L 


PTPta pt' 4- 4- a t a ^ 
y y ct y LL.ctL.cta 


o a y Layoyao 


^a^tpr^tpaa 

a y a *_ w a. i— v_> cj. 


ataacaaaaa 

C- CL O* CA Ci Ci Ci Ci 


4980 


H. cl ct ct l y y octy 


pa 4- a f" pa a "h a 
Oct La LOcta La 


pp'h*pfprp"H'l _ Pf^ 

ooLyyoL. l y a 


dCCTP HP CPj"\~ P 
y v_. y a y a cj. l. a 


at p ppattac 

Ci C Ci V_> C* C- C^L \_> 


gtt cccggct 


5040 


gacgctgggt 


gatggttata 


etcagggega 


tattttcgat 


ggtattaact 


ttcgcggcgc 


olOU 


acaattggee 


tcagatgaca 


atatgttacc 


cgatagtcaa 


agaggatttg 


ccccggtgat 


5160 


ecaeggtatt 


gctcgtggta 


ctgcacaggt 


cactattaaa 


caaaatgggt 


atgacattta 


5220 


taatagtacg 


gtgccaccgg 


ggecttttae 


catcaacgat 


atetatgecg 


caggtaatag 


5280 


tggtgacttg 


caggtaacga 


tcaaagaggc 


tgaeggcage 


aegcagattt 


ttaccgtacc 


5340 
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ctattcgtca 


gtcccgcttt 


tgcaacgtga 


agggcatact 


cgttattcea 


ttaeggcagg 


5400 


a ga a t a. c c g t 


agtggaaatg 


cgcagcagga 


aaaaacccgc 


tttttccaga 


gtacattact 


5460 


ccacggcctt 


ccggctggct 


ggacaatata 


tacrtcraaaca 


caactggegg 


ategttateg 


5520 


tcrcttttaat 

' ^3 V-* *— ^— CJ. CJ- C- 


t t eggtateg 


ggaaaaacat 


gggggcactg 


ggcgctctgt 


ctgtggatat 


5580 


u3 CJ C-* CJ. CJ CJ 


Ci Ci C w C^ Ci C*» CJ. C*/ 


tt cccgat ga 


cagt cage at 


gaeggacaat 


cggtgcgttt 


5640 


tcfnt'a't'aan 

* — w 1 — C^ C Cl C CI CL w 


aaategctea 


at gaat cagg 


cacgaat att 


cagt tag tgg 


gttaccgtta 


5700 


ttraacracfc 

v— I— > — y LA Cl y V 


ggat at ttt a 


at tt cgctga 


t acaacat ac 


agt cgaatga 


atggctacaa 


5760 


cat Irfaasra 

*w L^L 1 — C y Cl CI CI ^ Cl 


C^> CJ- CJ CJ. v ^-' CJ VJ W CJ 


ttattcaggt 


t aagccgaaa 


ttcaccgact 


att acaacct 


5820 


capttataap 

w y C C CI C CI CI V-' 


saacapaaaa 

CJ- CJ. Ci C^ ^ C^» Cj CJ y Ci 


aattacaact 


caccgttact 


cag caac teg 


ggcgcacatc 


5880 


CJ. CJ. V^/ O. C-* l^- CJ C- Ci C- 


ttaaataata 

c cyciy cyy ccl 


gecat caaac 


ttattgggga 


acgagtaatg 


tcgatgagca 


5940 


attppaaapt 

Cl I— L. ^ C* cl y y > — > C 


aaattaaata 

y y Cl C 1 — CL CL CL C CL 


ct crccftt ccra 

v-» c y C^ l_ C- c>* ^ CJ- 


agat at caac 


t ggaegctea 


get at agect 


6000 


y ci \w> y cia. ci ci ci c> 


crpotaacaaa 

y cyy * — ■ <~< cl cl 


aacrcraccrcrQa 

CJ ci y ^ c^t y C^ CJ- 


t cagatgt t a 


gegett aacg 


tcaatattcc 


6060 


tttr'aar'pap 

i_ i— L- c, ci y v_< ^ d 


t rrcrpt rrprrt t 

cyy \^ c y w y c c 


ctgacagt a a 


atctcaatcrcT 

CJ- C* \s C C> Cl ^ C- C^ CJ 


cgacatgcca 


gtg ccagct a 


6120 


rarrpaffilTa 

v — - Cl y CL y L . Ci 


raraa'bct'ra 


aeggt eggat 


gaccaatctg 


actaatatat 

y\_,i_yy »— y >— i-l 


aeggtaegtt ' 


6180 


aptcraaaaap 

y * uy y cl cl y ci 


CJ- CJ. v-^ CJ. CJ. O U^. 


getatagegt 


gcaaacccjgc 


t atgccgggg 


aaaacaataa 

y u.y y oy u i—yy 


6240 


aaataapaaa 

CL CL Cl L— CL y y V-j CL 


acrt acaacrct 

CJ, y CJ, C> CJ. ^J 'wj C^ 


acgccacgct 


gaat tat cgc 


ggtggttacg 


geaatgecaa 


6300 


tatpaattac 

1 — CL C y y 1 — <— CL C < 


aacoatacrccr 

CJ. CJ C^ CJ. C- CJ. CJ CJ 


at gat at t a a 


gcagctctat 


taeggagtea 


crcacrtcraacrt 


6360 


act" aact cat 


gccaatggcg 


t aacgctggg 


geagcegtta 


aacgataegg 


taatacttat 


6420 


taaaapappt 

C CL CL CL y C y V— ' C^ C 


era cctp a aa a cr 

CJ Vwj C>> CJ C/ CJ. CJ- CJ. CJ. y 


at gcaaaagt 


cgaaaaccag 


acgggggtgc 


gtaccgactg 


6480 


crcTft' cfcif~ t a t~ 

y *_*y c y y LL.au 


nr 1 cert ncf~ cic 

V-J N*h* L— CJ C^ C. CJ C^ 


ct t atgecac 


tgaatat egg 


gaaaatagag 


taaccfctaaa 


6540 


farcaafacc 

C- CJ, C-^ >w CJ- CJ. O Ci V_y 


ptcrcrctcra i~a 

C-* l — CJ CJ v_*» C- ^ CJ. C- CJ. 


aegt cgat tt 


agataacgeg 


gttgctaacg 


ttgttcccac 


6600 




at cert cfccracr 

Ci C C^ CJ L— CJ C* V-J CJ. CJ 


pacracrtttaa 

C^> CJ. CJ CJ- CJ I— C- C CJ, CJ- 


agegegegtt 


gggat aaaac 


tgetcatgae 


6660 


y \^ k y cl c c^ cl c* 


CJ. CJ. 1— CJ. CJ. C CJ- CJ- ^wj C-^ 


pcrpi"cfppcrtt 

C-* C CJ C^ C-^ CJ C C 


taaaaccrata 

cyyyycycicy 


gt gacatcag 


agagtageca 


6720 


era at acre acre 

y cl y * — 1 — L y y y v^r 


at t gtt gegg 


ataatggtca 


ggtttacctc 


ageggaatge 


etttageggg 


6780 


^aaaattpaa 

ci cl ci cl y c cv_»ciy 


ntcra aatcrccf 

y c y ci ci ci c y y y 


era na a na craa 

y cl y ci cl y cl y y cl 


aaat get cac 


tgtgt cgeca 


attatcaact 


6840 


y v-' cl \s ci y cl y 


a rrt pa cscfi ar 

cl y Lvdy^cty u 


an"M"attaac 

cl y c C cl C c cl i 


ccagct at ca 


get gaat gtc 


qttaacfQaaa 


6900 


r^rrtaataaa^ 
uy L,y a ay a 


aapaaapptt 

CL Cl C» CL Cl CL \_s C C 


tttat pt tct 

c c c cl L— w c c v c 


at apCTct ttt 

CJ C-* CJ C> C- 1— c^ c- 


tt at aac taa 

c c y cyyc^i— yy 


egg t gag tea 


6960 


cgctttggct 


geggatagea 


cgattactat 


ccgcggctat 


gtcagggata 


aeggctgtag 


7020 


tgtggccgct 


gaatcaacca 


attttactgt 


tgatctgatg 


gaaaacgegg 


cgaagcaatt 


7080 


taacaacatt 


ggegegaega 


ctcctgttgt 


tecatttegt 


attttgetgt 


caccctgtgg 


7140 


taatgccgtt 


tetgeegtaa 


aggttgggtt 


tactggcgtt 


geagatagee 


acaatgecaa 


7200 


cctgcttgca 


cttgaaaata 


cggtgtcagc 


ggcttcggga 


ctgggaatac 


agcttctgaa 


7260 
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tgagcagcaa 


aatcaaatac 


ccct taatgc 


tccat egtec 


gcgctttcgt 


crcraccrapppt 


7320 


era cere ccr crcrt 


aaaccaaata 


cget gaat tt 


ttaccrccccfCT 


etaatggega 


^— ^ C-i Ci C^J C^ C. C<j C> Cv 


7380 


t crt cactaca 


aaacatatca 

y y y L* Cl U UL U Cl 


a t gccacggc 


tacctt cact 


cttgaatatc 


aQtaaptpera 

y u d d d uy y d 


7440 


cratcrctpatcr 


a 3 3 1 crcrt err 1 a 

d d d U y y d 


a a cert cr crcrt a 


t crt at t crcrpcr 


gcaatattgg 


CCTPtpcrpaacr 

d y d i_ d y d ct ay 


7500 


C \-A CJ. C^ Vwi Ci I— Ci 


pa rrerpaere t r i ef 

lclu y l> ci y d y 


atrfirarral" 

I— I— C-rt \w* LJ. O 


Pr^pererterr^riP 

Vw> CL d* d U LU Ci Ci d 1 


ggt aaggt eg 


'[~.CCICCRRRC , ^ , 
' — d y d da a add 


7560 


Qt crt" 3 err crt - t~ 

^3 c ^ c» Ci y Cj l— i— 


tpPaPPaPPa 
U U* * — ' Cl V_> U* Ct d» Ci 


a tgccacggt 


t gat ct egge 


gatctt tatt 


c 1 1 1 pa ert pt 

V-^ C C- C "d-» Ci CJ C O C- 


7 620 


tat" crt ctcrcc 

v — *— *- U v-j U L* U y U* V 


crcrercrccfcrca t 

y y y V—-* y C-4. C 


ccfgcctggca 


t gatgtt gcg 


ctt gagtt ga 


ctaattptpp 

d- L— d. d c u y C d o 


7 680 


crcrt crcrcf a a per 


t p era per crt 

Lwy uy y y l d a 


pterccacrptt 


d a. y dy y y y l ci 


erp per 3 pa crt a 

y d d y d d d y i_ d 


p ppera t a 1 1~ a 

d dy y a l a l l a 


7740 


aQa.a.Qooa.y 


y y y auuy L»y l 


ci a. a. ci • a. <— ^ cj. 


erttaer3erp1"3 
y l. Lay ay l ua 


p^ereratcrapa 
dayya l y a d a 


crt erere i aa e , ae i 
y Lyydaadad 


7800 


O- L Ly CICl LaU L 


yy uy L/Qul-lq 


aaaoay u toa 


rrrrt" rrera t~ era 1~ 
y yj ^yyaLya l. 


Ldd ULCLLUU L 


r , a ere , err , a e* - ! - ^ 
daydydadL l 


78 60 


e* rr~* rr "T *f~ ;a pan 
d d d y l l a d a y 


y LLay ay L-a l 


L.y cLv_^ci.y L.cicLd 


y y yy y 


arlrarfrrrfaa 
ad uLayyyGa 


e 1 e*a +~ I - c r rrrrc 
dda l uLciyyL 




ay Ly ci l Lay l< 


a 1" rr'ha "ha 


ppj-apar7r , i _ rT 


a a ppprra a aa 
aadddy aaya 


rrR~\~rtR : {~^rf'\~R 
yaLyaLLy La 


R^rrtRRRfTTRn 

a uy auaLy ay 


7 980 


ttattaccct 

1 — ' — a 1— 1— CI d d u, u 


crt ttcrptcrta 


pterptcfr^terer 


erpt crcrt p crcrt 

d d L d y t — d d d C 


aa a farrfacr 

Ci Ci Ci l y d d i — y y 


tpattppppt 

L L* Ci L L d y d d U 


8040 


crt aaBarprrr 

y L.uuciaL/LiyL' 


^~ , aat^f^^+~p^^ , ^ , 

ci a i_ y y u. a. > — ■ 


PPtatPPPtr5 


1 1 ererperer't - erer 

c uy y Ly y uy y 


cr cfccccR at 

day d y d d d d 


crtttatcrtaa 

y u u u a u y u a a 


8100 


acctterpcrpp 

Ci V — - l_ I — . y U' V^-j O d 


pert pert era at 


crtcrcrcrcrcaaa 

y d d d d d d. d. d. 


acctcrcftccrt 

d d U* C- d d C- d y c 


crcratptttcQ 

y y ci c \— • c u. y 


accrcaaatpt 

d. U* y d Ci Ci Ci U d u 


8160 


tttcfccataa 

C I — ■ U y U' < — i. 1 — Ui 


ccrattatcccr 


gaaaccatt a 


cagact atgt 


cacactgcaa 


egaggctegg 


8220 


pttatcrcrpcrcr 

d u i — d u y y dy y 


pert crt t r t p t 
y y 


aatttttPPQ 


Cfcrappcrtaas 

y V-j Ci L- CJ. CJ. 


atataertcrcfc 

d u. d i — d y y y d 


actacrptatp 

d y u d y d u d u d 


8280 


rattlrfhar 

d a ' — - i — - i — - d d u a d 


uauuay o y a.ci 


a pcrppcrperpcr 

d d y d d y d y d y 


ttatttataa 

LI— y LLLdLd d 


ttperaeraapcr 

l l d y a y d cl d y 


era t a a pp pert" 

y d i — d d y d dy u 


8340 


cr erp per crt cr rr p 

yy o^yy uy y d 


actttattta 


r p erect crt era 


apacrt apcraa 


p per crcrt aaca 

dy y y y l y y dy 


at taaacTPtcr 

Ci C C Ci Ci Ci Cj c* C Cj 


8400 


rfrt"rat"r*aaf 


tcrppcrtcfptt 


attttcrper^p 


serflpp^ripaa 

Ci CJ Ci Ci Ci 'w Ci Ci 


ptataacacrp 

V--" C Ci C Ci Ci ^ Ci \^ 


patpatttpp 

y d. u y d u u u d d 


8 4 60 


a crt 1 1 crt at a 

Ci Cj C- C- C y C ^ I — \-4 


craatatttac 


gecaataatg 


atcrtcrcrtcrcrt 


gcctactggc 


QQptcrcpatcr 

yyduydya uy 


8520 


tttctcrctccr 

C* C- c c** c* ^ V— * C \_> y 


tcratcrtcacc 


crttactctcrc 


pcrcrapt appp 

d y y cc »w c d u,/ d d 


t pcrtt cacrt cr 

L— ^ C- C- Ci >dj C- v^j 


ccaattpptp 

U^ U< Ci Ci U U d d U U- 


8580 


ttapperttta 

u i_ d d y u u a 


tterterpcraaa 

l i,y uy y cj. c-i. 


d y d d d d d d d d 


t ererercf t a 1 1 r 

l y y y y l a l l a 


pptptcccrcrc 

V_y C* C^ C- V-> C* C< 


d d d d d d y d a y 


8 640 


a"t~crppcrppaa 

a Lydyyydaa 


ptpcrattttp 


d d d d d U- d d d y 


eTTi'r'rf'h"}'"! - '!"^ 

dy Ldy LLLLd 


apptppacacr 

a d d l y d d d d y 


crerpefhprrere'pr 
yy^—y ^^yy^y 


87 00 


Lauay i_ L.y aL. 


rrprrpa a /~* rrerf~ 
yL-yuaaoyy i_ 


dv. y a L La L LL> 


parrprraat"aa 
ddLjdyaa l a a 


r~ , aif" , nTft"a1 - e , rT 
LaLyy La Ldy 


l uay yayLay 


O / D U 




rfrr(~ , rrer'i~ef3 ert 
yyuyy uyciy l. 


d i_ y y y a i_ l, cl cl 


Ly y \ — a a a. l La 


t erp a pert a CP 
l y d a v — y Lddd 


ppacfcrppa erer 

yy^yyy^^yy 




■\- Cfpi pf" npa errr 
i y a d u y d a y y 


efaateftefpaa 

y aa l. y i_ y ^ a. 


te i rfr?ttatter 

i d y d L- d L Ly 


crperter^pttt 

y d y l y a d l l l 


tert 1 1 atcaa 

l_ CJ C- C- C- Ci C V--' Ci Ci 


taatptapaa 

1 — d d u d u a y a a 


8880 


ggatccccgg 


gtaccgagct 


cgaattcact 


ggccgtcgtt 


ttacaacgtc 


gtgactggga 


8940 


aaaccctggc 


gttacccaac 


ttaatcgect 


tgcagcacat 


ccccctttcg 


ccagctggcg 


9000 


taatagcgaa 


gaggcccgca 


ccgatcgccc 


ttcccaacag 


ttgcgcagcc 


tgaatggcga 


9060 


atggcgcctg 


atgeggtatt 


ttctccttac 


gcatctgtgc 


ggtatttcac 


acegcatatg 


9120 


gtgcactctc 


agtacaatct 


getctgatge 


cgcatagtta 


agccagcccc 


gacacccgcc 


9180 
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aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc 9240 
tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcg 92 9 9 

<210> 169 

<211> 8464 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pFIMAICDFG 



<400> 169 
cgagacgaaa 


gggcctcgtg 


atacgcctat 


t. tit. taraggti 


t aatgtcatg 


at aataatgg 


bu 


tttcttagac 


gtcaggtggc 


acttttcggg 


gaaatgtgcg 


cggaa cccct 


_ 4- 4- 4- ^4- 4- 4 , 4- 

atrtgti c rati 


ion 


ttttctaaat 


acattcaaat 


atgtatccgc 


tcatgagaca 


at aaccctga 


t aaatgcttc 


i on 
loU 


aataatattg 


aaaaaggaag 


agtatgagta 


tt caacattt 


ccgt gtcgcc 


cttattccct 


9/i n 

Z 4 U 


tttttgcggc 


attttgcctt 


cctgx r mz rg 


ctcacccaga 


aacgctggtg 


aaagtaaaag 


JUL) 


atgctgaaga 


tcagttgggt 


gcacgagtgg 


gttacatcga 


actggatctc 


aacagcggta 


O C C\ 


agatccttga 


gagttttcgc 


cccgaagaac 


gt t t t ccaa t 


gatgagcact 


t ctaaagt tc 


/ion 
4 Z U 


tgctatgtgg 


cgcggtatta 


t cccgtattg 


acgccgggca 


agagcaact c 


ggt cgccgca 


a q n 
4oU 


tacactattc 


tcagaatgac 


ttggttgagt 


actcaccagt 


cacagaaaag 


catcttacgg 


540 


atggcatgac 


agtaagagaa 


ttatgcagtg 


ctgccataac 


catgagtgat 


aacactgcgg 


600 


ccaacttact 


tctgacaacg 


atcggaggac 


cgaaggagct 


aaccgctttt 


ttgcacaaca 


660 


tgggggatca 


tgtaactcgc 


cttgatcgtt 


gggaaccgga 


gctgaatgaa 


gccataccaa 


720 


acgacgagcg 


tgacaccacg 
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cla.CclCL.yT_.cl L. 


4-4-fra pt4"" pf pr4- ai 

L Lydy Ly y l cl 


PTP'P'ai'hp'aiasaiP' 
yLLcL LLaaaL 


f-4--3 4- 4- prpTPTPTaq 
l l a l Lyyyya 


ppaap/t"3at"ff 

aLyay i— aai_y 


t"rrr^i _ n'r^rrp , 3 

l Ly a l y ay l a 


5940 


d LLLLayy CL 


/^r pf ra4— 4p:a.raaiT~ai 
y y a L LadaLa 


P* , "f~PfP , pf'T'f~P >, PTaa 

l l y l y l LLya 


aiPTai*ra54-paia : ?P 1 
aya La l l a a l 


i~nrTP\c , Cfr'i~c , P\ 

Lyy a l y i_ a 


nC^'R'f'PiCICC'f' 
y L« l a l a y l« l> l 


6000 


PT TT] /~i /T t3 ^ Q 3 3 / — i 

yaLLj aaaaaL 


/-t ri p 1" repp a a a 

y LLLyyLddd 


as ai ptprai p^prpfpfaa 
aayyaLyyya 


4ppaipra!4 _ pr'f - 4--ai 
l Lay a Ly l lcl 


pppp"hi"3app 
y l y l l LaaLy 


"hr*P43"ra'i"iT'P 1 

L L a a l a ' — l l l. 


6060 


4~ 4~ 4~ P ai /-r p p ai P 
L L LLay LLaL 


t~ /'tptpt~p/pp/'T~ 
LyyLLyLy ll 


l Lya Lay l a a 


aq+"p<4-p'airi"4-rtPT 
a l l LLay Lyy 


pfi^pa'hrfppa 

uy a w a i_ y a 


nf~ClCCPi CfCf" Pi 
y l y l» v_r ci y l a 


6120 


/"< -3 /~r p -3 4~ Pf 4- p Cl 

Lay La Ly LLa 


p« a prra ■)" pf pa 
L CL L y Cl LL LLa 


Pi pi pr pr f~ crtcfp\'\~ 
a l y y Luy y a l 


PTa^ppaiai"} - p , "hPT 
yaLLaa ll Ly 


pf pf- pf pr 4- rr+- a 4- 
Lyy i— y i_ai_- 


p\ PTTprt" a p pft" i~ 

cj. l- y ^ La l^ y l l 


6180 


y l Ly y aay aL 


aaaapaiaiPPT~pas. 
aaLaaLL LLa 


PT P 1 "h ai 4- ai prp^ PT*f~ 

y l La Lay Ly l 


ncpi Pi p\ p p 1 ptpt p 1 
y LaaaLLyy l 


4- ai 4- pf p ppf pr pr pf 
La LyLLyyyy 


nPi n a C a R f~ an 
y a y y Ly a Lyy 


6240 


n 2 ■] 2 /~r p /~f /-v aa 

addLdgLtjCjd 


ciy LaLctyyLL 


a ppppa r~" pr p 1 4~ 
Ci L y LLdLy L L 


Praiaj4-4- i p54-pipYP-' 
yaa l La LLy l 


pr Pf4p pr pr4- 4p as ppr 

yy Lyy l l a l y 


Pfpaia54-pfp'p'aai 
y L a a l y l l a a 


6300 


4- ^ 4- p pt /t 4— 4" ai P 
LdLLyy L LdL 


1 /~f /-I /—i ~3 4~ O PY P P" 

dy LCd LciyLy 


a54~prai4 - a5"l - 4 - a3a) 
a Ly a La L Laa 


prp , aiPTp , 'i~p , ^ - ai"h 
y l a y l lllcil 


f a prrpa rrr pa 
l a l y y a y LLa 


prpppf 4~ prprpfpr'} - 
y <-y y Ly y y y l 


63 60 


aCL^yCULa L 


yLLcXaLgyLy 


l a ct l y l Lyyy 


ptp' ai ptp* p~* rr^~ +- Pi 
y Lay LLy L La 


aiaippTai4-aipPTPf 
aa l y a l a l y y 


+"rrpfi~pTP , 'V"'rpr , r 
Lyy LyLLLy l 


6420 


4- aa aa -a pr p pt p p 4~ 
Ldaay LyLL L 


rrPrr^PTP , aa a aa. aa pr 
yyLyLadaay 


ai 4" ptp* ai ai ai a^i n"t~ 
a LyLactaay l 


(TfP)PiPiPi(T , Pir{ 
v — y a a a a La y 


ai p prpf pr pf PT"h pr p 

dL> yy yy y y t — 


pphr^ c CClPi c~\~ n 

y l a l- Ly a l> l y 


6480 


/^pp4ppfpr4p-f-;=>4-- 

(JLy Ly y LLaL 


pfP , ppTT~pfp't~prp 
y LLy Ly LLy l 


pi4--ha)4-PTP , r , ?4f~' 
L L La LyLLaL 


i-Qa3af-a54-p-«pYpT 


pf ai aaaf^ prta pf 
y a a a a i_ a y ay 


1~pj'CfprTPt"n'n'3 

Lyy l» y ^ — • l y y a 


554 0 


LdLLaCl L a L L 


P"f~PfPYPt"'PTai,'r~ai 
l Ly yL l y a l a 


ai P" | PT't~P , PT3't~"h"r 
a.Ly l l y a l l i— 


pi n r 1r_p) pi pprpTr 

y a i — L^i. a. * — * y v^. y 


"h't _ P'"ha3a)p 
y l l y l l a a l- y 


■hi~rr1""hppp3P 

L l y * L ^- — ' L» l* a L 


6600 


4-ppi"T"PTPTPfPfP' , PT 

Hey Ly y y y Ly 


aa 4- p pf4p pr p praa pf 
a LLy L-y Ly ay 


p , aiPTa5pf'} - i~4 - a^^ 
Lay ay l l Laa 


Pinc'CicciC'Cfi'i' 

a y l y ^ y v_> y i_ 


PTPrPfaa4-aja5aia3p 
y y y a i— a a a a l 


'hprr , i - p , r : 5i~PTa3p 
l y l LLa l y a l 


6660 


yC LyaLLLdL 


a a ■)- -i a "f" a a rrp 
act Ldd Lddy L 


p^ptp , 4~ptp > 'p , pt4 _ 4/- 

LyLLyLLy l l 


■hrrPTPTPfp , pra»4-pt 

L yyyy u y d L y 


pf 4- pr paj4-p-apf 

y LyaLaLLay 


ai prai pt" apppa 
ay ay Lay LLa 


U / 


gay Lay Ly y l 


aa4~4-pf4p4ppfppfpf 
a l Ly l LyLyy 


Pi +~ Pi Pt +~ prpr4~ p" 1 ai 
cl Lclct Ly y LLa 


y y i— ■ a i — > <_ k^. 


a>PfpprPTai=»-hpTp 
ay Ly y aa Ly i — 


P+" +" "t" Pi PTPPTPf Pf 
ll l l a y l y y y 


67 8 0 


aaaagttcag 


gtgaaatggg 


gagaagagga 


aaatgctcac 


tgtgtcgcca 


attatcaact 


6840 


gccaccagag 


agtcagcagc 


agttattaac 


ccagctatca 


gctgaatgtc 


gttaaggggg 


6900 


cgtgatgaga 


aacaaacctt 


tttatcttct 


gtgcgctttt 


ttgtggctgg 


cggtgagtca 


6960 


cgctttggct 


gcggatagca 


cgattactat 


ccgcggctat 


gtcagggata 


acggctgtag 


7020 


tgtggccgct 


gaatcaacca 


attttactgt 


tgatctgatg 


gaaaacgcgg 


cgaagcaatt 


7080 
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taacaacatt 


qqcqcqacqa 


ctcctgttgt 


tccatttcgt 


attttgctgt 


caccctgtgg 


7140 


taatgccgtt 


tctgccgtaa 


aggttgggtt 

Z) ZJ -J ~) —I 


tactggcgtt 


gcagatagcc 


acaatgccaa 


7200 


cctgcttgca 


cttgaaaata 


cggtgtcagc 

Z) Z) 3 — ' 


ggcttcggga 


ctgggaatac 


agcttctgaa 


7260 


tgagcagcaa 


aatcaaatac 


cccttaatgc 


tccatcgtcc 


gcgctttcgt 


ggacgaccct 


7320 


qacgccaqgt 


aaaccaaata 


cgctgaattt 


ttacgcccgg 


ctaatggcga 


cacaggtgcc 


7380 


tgtcactgcg 


gggcatatca 


atgccacggc 


taccttcact 


cttgaatatc 


agtaactgga 


7440 


gatgctcatg 


aaatggtgca 


aacgtgggta 


tgtattqqcq 

ZJ -J ZJ ZJ 


gcaatattgg 


cgctcgcaag 


7500 


tgcgacgata 


caggcagccg 


atgtcaccat 


cacggtgaac 


ggtaaggtcg 


tcgccaaacc 


7560 


gtgtacggtt 


tccaccacca 


atgccacggt 


tgatctcggc 


gatctttatt 


ctttcagtct 


7620 


tatgt ctgcc 


aacraccracat 


cggcctggca 


taatattqcq 


cttgagttga 


ctaattgtcc 


7680 


aataaaaaca 


t cgagggtca 


ctgccagctt 


caacaagqca 


gccgacagta 


ccggatatta 


7740 


taaaaaccag 


QQQB.OCQCQC 


aaaacatcca 


gttagagcta 


caggatgaca 


gtggcaacac 


7800 


attgaatact 


ggcgcaacca 


aaacagttca 


ggtggatgat 


tcctcacaat 


cagcgcactt 


7860 


cccgt tacag 


gtcagagcat 


tgacagtaaa 


tggcggagcc 


actcagggaa 


ccattcaggc 


7920 


agtgattagc 


atcacctata 


ectacagctg 


aacccgaaga 


gatgattgta 


atgaaacgag 


7980 


ttattaccct 


gtttgctgta 


ctgctgatgg 


gctggtcggt 


aaatgcctgg 


tcattcgcct 


8040 


gtaaaaccgc 


caatggtacc 


gagctcgaat 


tcactggccg 


tcgttttaca 


acgtcgtgac 


8100 


tgggaaaacc 


ctggcgttac 


ccaacttaat 


cgccttgcag 


cacatccccc 


tttcgccagc 


8160 


tggcgtaata 


gcgaagaggc 


ccgcaccgat 


cgcccttccc 


aacagttgcg 


cagcctgaat 


8220 


ggcgaatggc 


gcctgatgcg 


gtattttctc 


cttacgcatc 


tgtgcggtat 


ttcacaccgc 


8280 


atatggtgca 


ctctcagtac 


aatctgctct 


gatgccgcat 


agttaagcca 


gccccgacac 


8340 


ccgccaacac 


ccgctgacgc 


gccctgacgg gcttgtctgc 


tcccggcatc 


cgcttacaga 


8400 


caagctgtga 


ccgtctccgg 


gagctgcatg 


tgtcagaggt 


tttcaccgtc 


atcaccgaaa 


8460 


cgcg 












8464 



<210> 170 
<211> 27 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic M2 Peptide 
<400> 170 

Ser Leu Leu Thr Glu Val Glu Thr Pro He Arg Asn Glu Trp Gly Cys 
15 10 15 

Arg Cys Asn Gly Ser Ser Asp Gly Gly Gly Cys 
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20 25 



<210> 171 
<211> 97 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Matrix protein M2 
<400> 171 

Met Ser Leu Leu Thr Glu Val Glu Thr Pro He Arg Asn Glu Trp Gly 
15 10 15 

Cys Arg Cys Asn Gly Ser Ser Asp Pro Leu Ala He Ala Ala Asn He 
20 " 25 30 

He Gly He Leu His Leu He Leu Trp He Leu Asp Arg Leu Phe Phe 
35 40 ■ 45 

Lys Cys He Tyr Arg Arg Phe Lys Tyr Gly Leu Lys Gly Gly Pro Ser 
50 55 60 

Thr Glu Gly Val Pro Lys Ser Met Arg Glu Glu Tyr Arg Lys Glu Gin 
65 70 75 80 

Gin Ser Ala Val Asp Ala Asp Asp Gly His Phe Val Ser lie Glu Leu 
85 90 95 

Glu 



<210> 172 

<211> 770 

<212> PRT 

<213> Homo Sapiens 

<400> 172 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 • 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 " " 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 
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Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

lie Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly lie 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met lie 
290 295 300 

Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe 
305 310 315 320 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr 
325 330 ' 335 

Cys Met Ala Val Cys Gly Ser Ala Met Ser Gin Ser Leu Leu Lys Thr 
340 345 350 

Thr Gin Glu Pro Leu Ala Arg Asp Pro Val Lys Leu Pro Thr Thr Ala 
355 360 365 

Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu Glu Thr Pro Gly Asp 
370 375 380 

Glu Asn Glu His Ala His Phe Gin Lys Ala Lys Glu Arg Leu Glu Ala 
385 390 395 400 

Lys His Arg Glu Arg Met Ser Gin Val Met Arg Glu Trp Glu Glu Ala 
405 410 415 

Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp Lys Lys Ala Val He 
420 425 430 
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Gin His Phe Gin 
435 

Glu Arg Gin Gin 
450 

Leu Asn Asp Arg 
465 

Gin Ala Val Pro 



Tyr Val Arg Ala 
500 

Glu His Val Arg 
515 

Gin Val Met Thr 
530 

Leu Ser Leu Leu 
545 

Glu Val Asp Glu 



Leu Ala Asn Met 
580 

Leu Met Pro Ser 
595 

Val Asn Gly Glu 
610 

Gly Ala Asp Ser 
625 

Asp Ala Arg Pro 



Gly Leu Thr Asn 
660 

Ala Glu Phe Arg 
675 

Val Phe Phe Ala 
690 

Leu Met Val Gly 
705 

Val Met Leu Lys 



Glu Val Asp Ala 
740 

Gin Gin Asn Gly 
755 



Glu Lys 



Leu Val 

Arg Arg 
470 

Pro Arg 
485 

Glu Gin 

Met Val 

His Leu 

Tyr Asn 
550 

Leu Leu 
565 

lie Ser 

Leu Thr 

Phe Ser 

Val Pro 
630 

Ala Ala 
645 

lie Lys 

His Asp 

Glu Asp 

Gly Val 
710 

Lys Lys 
725 

Ala Val 
Tyr Glu 



Val Glu 
440 

Glu Thr 
455 

Leu Ala 



Pro Arg 



Lys Asp 



Asp Pro 
520 

Arg Val 
535 

Val Pro 



Gin Lys 



Glu Pro 



Glu Thr 
600 

Leu Asp 
615 

Ala Asn 

Asp Arg 

Thr Glu 

Ser Gly 
680 

Val Gly 
695 

Val He 

Gin Tyr 

Thr Pro 

Asn Pro 
760 



-93- 
Ser Leu 

His Met 

Leu Glu 

His Val 
490 

Arg Gin 
505 

Lys Lys 

He Tyr 

Ala Val 

Glu Gin 
570 

Arg He 
585 

Lys Thr 

Asp Leu 

Thr Glu 

Gly Leu 
650 

Glu He 
665 

Tyr Glu 

Ser Asn 

Ala Thr 

Thr Ser 
730 

Glu Glu 
745 

Thr Tyr 



Glu Gin 



Ala Arg 
460 

Asn Tyr 
475 

Phe Asn 

His Thr 

Ala Ala 

Glu Arg 
540 

Ala Glu 

555 

Asn Tyr 

Ser Tyr 

Thr Val 

Gin Pro 
620 

Asn Glu 
635 

Thr Thr 
Ser Glu 



Val His 



Lys Gly 
700 

Val He 
715 

He His 



Arg His 



Lys Phe 



Glu Ala 
445 

Val Glu 
He Thr 
Met Leu 



Leu Lys 
510 



Gin He 
525 



Met Asn 



Glu He 

Ser Asp 

i 

Gly Asn 
590 

Glu Leu 
605 

Trp His 



Val Glu 



Arg Pro 



Val Lys 
670 

His Gin 
685 



Ala He 



Val He 



His Gly 



Leu Ser 
750 

Phe Glu 
765 



Ala Asn 

Ala Met 

Ala Leu 
480 

Lys Lys 
4 95 

His Phe 

Arg Ser 

Gin Ser 

Gin Asp 
560 

Asp Val 
575 

Asp Ala 
Leu Pro 



Ser Phe 

Pro Val 
640 

Gly Ser 
655 

Met Asp 

Lys Leu 

He Gly 

Thr Leu 
720 

Val Val 
735 

Lys Met 
Gin Met 
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Gln Asn 
770 



<210> 173 

<211> 82 

<212> PRT 

<213> Homo Sapiens 

<400> 173 

Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser Glu Val Lys 
1 ~ 5 10 15 

Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gin 
20 25 30 

Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala He 
35 40 45 

He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val He He He 
50 55 60 

Thr Leu Val Met Leu Lys Lys Gin Tyr Thr Ser Asn His His Gly Val 
65 70 75 80 

Val Glu 



<210> 174 

<211> 42 

<212> PRT 

<213> Unknown 
<220> 

<223> Amyloid Beta Peptide 



<400> 174 

Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys 
15 10 15 

Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala He He 
20 X 25 30 

Gly Leu Met Val Gly Gly Val Val He Ala 
35 40 



<210> 175 
<211> 12 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> p33 peptide 
<400> 175 

Cys Gly Gly Lys Ala Val Tyr Asn Phe Ala Thr Met 
1 5 10 

<210> 176 

<211> 37 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> DP178c peptide 
<400> 176 

Cys Tyr Thr Ser Leu lie His Ser Leu lie Glu Glu Ser Gin Asn Gin 
15 10 15 

Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser 
20 25 30 

Leu Trp Asn Trp Phe 
35 

<210> 177 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> c-terminal linker 
<400> 177 

Gly Ser Gly Gly Cys Gly 
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1 5 

<210> 178 

<211> 65 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GRA2 



<400> 178 

Lys Glu Ala Ala Gly Arg Gly Met Val Thr Val Gly Lys Lys Leu Ala 
1 5 10 15 

Asn Val Glu Ser Asp Arg Ser Thr Thr Thr Thr Gin Ala Pro Asp Ser 
20 " 25 30 

Pro Asn Gly Leu Ala Glu Thr Glu Val Pro Val Glu Pro Gin Gin Arg 
35 40 45 

Ala Ala His Val Pro Val Pro Asp Phe Ser Gin Gly Ser Gly Gly Cys 
50 55 60 

Gly 
65 

<210> 179 
<211> 18 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> D2 peptide 
<400> 179 

Cys Gly Gly Thr Ser Asn Gly Ser Asn Pro Ser Thr Ser Tyr Gly Phe 
15 10 15 

Ala Asn 

<210> 180 
<211> 18 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> B2 peptide 
<400> 180 

Cys Gly Gly Asp He Ser Asn Gly Tyr Gly Ala Ser Tyr Gly Asp Asn 
1 5 10 15 

Asp He 

<210> 181 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> muTNFa peptide 
<400> 181 

Cys Gly Gly Val Glu Glu Gin Leu Glu Trp Leu Ser Gin Arg 
15 10 

<210> 182 

<211> 22 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TNFa II ( 3 1 -TNFa II) 

<400> 182 

Ser Ser Gin Asn Ser Ser Asp Lys Pro Val Ala His Val Val Ala Asn 
1 5 ~ 10 15 

His Gly Val Gly Gly Cys 
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20 

<210> 183 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TNFa II (5' TNFa II) 
<400> 183 

Cys Ser Ser Gin Asn Ser Ser Asp Lys Pro Val Ala His Val Val Ala 
15 10 15 

Asn His Gly Val 
20 



<210> 184 

<211> 182 

<212> PRT 

<213> Escherichia coli 



<400> 184 

Met Lys lie Lys Thr Leu Ala lie Val Val Leu Ser Ala Leu Ser Leu 
15 10 15 

Ser Ser Thr Ala Ala Leu Ala Ala Ala Thr Thr Val Asn Gly Gly Thr 
20 25 30 

Val His Phe Lys Gly Glu Val Val Asn Ala Ala Cys Ala Val Asp Ala 
35 40 45 

Gly Ser Val Asp Gin Thr Val Gin Leu Gly Gin Val Arg Thr Ala Ser 
50 ~ 55 60 

Leu Ala Gin Glu Gly Ala Thr Ser Ser Ala Val Gly Phe Asn lie Gin 
65 70 75 80 

Leu Asn Asp Cys Asp Thr Asn Val Ala Ser Lys Ala Ala Val Ala Phe 
85 90 95 

Leu Gly Thr Ala lie Asp Ala Gly His Thr Asn Val Leu Ala Leu Gin 
100 105 110 

Ser Ser Ala Ala Gly Ser Ala Thr Asn Val Gly Val" Gin He Leu Asp 
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115 120 125 

Arg Thr Gly Ala Ala Leu Thr Leu Asp Gly Ala Thr Phe Ser Ser Glu 
130 135 140 

Thr Thr Leu Asn Asn Gly Thr Asn Thr lie Pro Phe Gin Ala Arg Tyr 
145 150 155 160 

Phe Ala Thr Gly Ala Ala Thr Pro Gly Ala Ala Asn Ala Asp Ala Thr 
165 170 175 

Phe Lys Val Gin Tyr Gin 
180 

<210> 185 
<211> 152 
<212> PRT 

<213> Hepatitis B virus 



<400> 185 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 5 .10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala lie Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Thr Asn Leu Glu Asp Gly Gly 
65 70 75 80 

Lys Gly Gly Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met 
85 90 95 

Gly Leu Lys lie Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr 
100 105 110 

Phe Gly Arg Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp 
115 12 0 125 

lie Arg Thr Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser 
130 135 140 

Thr Leu Pro Glu Thr Thr Val Val 
145 150 

<210> 186 

<211> 152 

<212> PRT 

<213> Hepatitis B virus 



WO 01/85208 



PCT/IB01/00741 



-100- 



<400> 186 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 5 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 " 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Ser 
35 " 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Thr Asn Leu Glu Asp Gly Gly 
65 70 75 80 

Lys Gly Gly Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met 
85 90 9 5 

Gly Leu Lys lie Arg Gin Leu Leu Trp Phe His lie Ser Ser Leu Thr 
100 " 105 110 

Phe Gly Arg Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp 
115 120 125 

lie Arg Thr Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser 
130 135 ~ 140 

Thr Leu Pro Glu Thr Thr Val Val 
145 150 



